This repository provided a home for orphaned data previously found on http://nlp.perseus.tufts.edu/lexicon/ (also http://static.perseus.tufts.edu/lexicon/)
Note that active management of this phase of the project ceased in 2011. A new phase was started at: http://www.dynamiclexicon.com
Treebank work may be found here The Ancient Greek and Latin Dependency Treebank with links to https://github.com/PerseusDL/treebank_data.
General Information
The Dynamic Lexicon is an NEH-funded project to automatically create bilingual dictionaries (Greek/English and Latin/English) using parallel texts - source texts in Greek or Latin aligned with their English translations - along with the syntactic data encoded in treebanks. From these raw materials, we can construct a lexical entry that illustrates how a word is used not simply in all of Greek or Latin literature, but in any subset of that collection (e.g., Greek drama or the works of Cicero).
Data The published form of the Dynamic Lexicon includes automatically generated lexical entries along with the underlying intermediate analysis used to generate them (including word-level alignments between source texts and their translations, and automatic morphological tagging and syntactic analysis for the Greek and Latin originals). All data is licensed under a Creative Commons Attribution-Sharealike license.
- Dynamic Lexicon entries. XML files for all automatically generated lexical entries. [Greek] [Latin]
- Automatically word-aligned parallel texts. [Greek] [Latin]
- Automatically aligned, tagged and parsed source texts. (via Dropbox) [Greek] [Latin]
- Treebank data. [Greek and Latin]
Publications Bamman, David, and Gregory Crane, "Measuring Historical Word Sense Variation," in: Proceedings of the 11th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2011) [pdf]
Bamman, David, and Gregory Crane, "The Ancient Greek and Latin Dependency Treebanks," in: Caroline Sporleder, Antal van den Bosch and Kalliopi Zervanou (eds.), Language Technology for Cultural Heritage (Springer, 2011). [pdf]
Bamman, David, and Gregory Crane (2009), "Computational Linguistics and Classical Lexicography," Digital Humanities Quarterly 3.1 [html]
Bamman, David, and Gregory Crane (2008), "Building a Dynamic Lexicon from a Digital Library," Proceedings of the 8th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2008) (Pittsburgh) [preprint]
Acknowledgements Grants from the National Endowment for the Humanities (PR-50013- 08, "The Dynamic Lexicon: Cyberinfrastructure and the Automated Analysis of Historical Languages") and NEH/DOE/NERSC ("Large-Scale Learning and the Automatic Analysis of Historical Texts") provided support for this work. This research used resources of the National Energy Research Scientific Computing Center, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.