这是indexloc提供的服务,不要输入任何密码
Skip to content

andre-geldenhuis/Colenso

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Colenso

Repo for Colenso Hackfest 2015

I will write a better description of the several data files at the weekend, but here already some data to play with.

I shall also add more data over the weekend and continue to extract material.

Links for additional wrangled Data

Result Topic-Modelling with R, Colenso Letters: https://www.google.com/fusiontables/DataSource?docid=1thLcTnppgdspjmawxVStMus18_vs7g9Ec29S3LRF

Topic Terms: https://docs.google.com/spreadsheets/d/1NYQIsqCsFIegGmqnMGb5LXFGoG59Epr2z8M1vsesRqw/edit?usp=sharing

Extracts from several PDFs

Specimen Collectors: https://docs.google.com/spreadsheets/d/1PtIGU99uQ6sx9uMlgkIM-XGYW6-vfgwoPdofOzyfiIc/edit?usp=sharing

Names mentioned in Colenso's work:https://docs.google.com/spreadsheets/d/1mImx0khy9Eqmhg9DqU1AdDqpa7ag6LtJKZpzuoVUcCA/edit?usp=sharing

List of Letters and Lists: https://docs.google.com/spreadsheets/d/1sbR2vrETM7jbzlox0GfNWfgzebWfjKu_U5Bc-zxLP08/edit?usp=sharing

Colenso's Travels I: https://docs.google.com/spreadsheets/d/13V01rs-6-zQezvYXTSMH2m1DV8ZEjDaywAVe_EBJkBw/edit?usp=sharing

Colenso's Travels II: https://docs.google.com/spreadsheets/d/1kdBBDojQ6svbONCQNyR1yvxcdMOi7OMAnSKtv036V-Q/edit?usp=sharing

Cleaned Textual Data From PDFs

All of 1833 to 1852-1.txt

Colenso_Death_Newspaper.txt

LettersAndLists.txt

LettersToTheEditor.txt

Overview_letters from James Hector to Joseph Dalton Hooker between 1860 and 1898.txt

PlaceNames.txt

Transcriptions of 22 letters from James Hector to his wife Georgiana.txt

Other Extracts

Day and Wast 1836-1843

Not very clean Data

Part 1.doc

Part 1.docx

Part 1.pdf

Part 1.txt

Part 2.docx

Part 2.pdf

Part 2.txt

About

Repo for Colenso Hackfest 2015

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published