NetSets

WikiSchools

4,403 articles of Wikipedia for schools with links between them.

Description
Download (2 MB)

WikiVitals

10,012 vital articles of Wikipedia with links between them and words used in summaries.

Description
Download (5 MB)

WikiHumans

1,014,428 articles of Wikipedia on humans with links between them and links to other articles.

Description
Download (95 MB)

WikiLinks

3,210,346 articles of Wikipedia with links between them and words used in summaries.

Description
Download (840 MB)

Openflights

3,097 airports with daily number of flights between them.

Description
Download (< 1 MB)

Cinema

Graph between 88,440 movies and 44,586 actors.

Description
Download (1 MB)

20newsgroup

Graph between 11,314 documents (in 20 newsgroups) and 56,126 words.

Description
Download (5 MB)

WikiDataSets

Topical subsets of WikiData, assembled using the WikiDataSets python library.

Extracted from Wikidata in April 2020.

Please consider citing our paper if you find this useful in your research.

Details of the project :

Labels dictionary

Labels of all Wikidata entities (April 15, 2020).

Download (2.4 GB)

Animals

Subgraph of WikiData containing only animal species.

Description
Download (105 MB)

Companies

Subgraph of WikiData containing only companies.

Description
Download (16 MB)

Countries

Subgraph of WikiData containing only countries.

Description
Download (< 1 MB)

Films

Subgraph of WikiData containing only films.

Description
Download (28 MB)

Humans

Subgraph of WikiData containing only humans.

Description
Download (409 MB)

Contact

"{}{}@enst.fr".format(first_letter_of_surname, name)
Thomas Bonald
Nathan De Lara
Quentin Lutz
Armand Boschin