Actions

Algoliterary Encounters: Difference between revisions

From Algolit

(Datasets)
(Datasets)
Line 22: Line 22:
  
 
==== Datasets ====
 
==== Datasets ====
* [[The datasets speak]]
+
* [[Many many words]]
 +
 
 +
* [[The Enron email archive]]
 +
* [[Common Crawl]] (used by GloVe): selection of urls (Constant, Maison du Livre...)
 +
* [[Google News]] (used by word2vec)
 +
* [[Learning from Deep Learning]] (from lib.gen.rus.ec) (.txt)
 +
* [[HG Wells personal dataset]] (from Gutenberg.org) (.txt)
 +
* Jules Verne (FR), Shakespeare (FR) -> download from Gutenberg & clean up
 +
* [[AnarchFem]] (from aaaaarg.fail) (.txt)
  
 
==== From words to numbers ====
 
==== From words to numbers ====

Revision as of 10:54, 24 October 2017


Start of the Algoliterary Encounters catalog.

General Introduction

Algoliterary works

Algoliterary explorations

A few outputs to see how it works


(Before: talking_about_machine_learning - exploring the vocabulary of machine learning textbooks in 7 stages with word2vec)

Parts of NN process

Datasets

From words to numbers

Different views on the data

Creating word embeddings using word2vec

Autonomous machine as inspection

Algoliterary Toolkit

Bibliography