Directory Help
Search only in Corpus AnalysisSearch the Web  

Corpus Analysis
  Science > Social Sciences > Linguistics > Computational Linguistics > Corpus Analysis   Go to Directory Home  

Categories
Tools (2)
WordNet (4)
Web Pages
View in Google PageRank order               Viewing in alphabetical order
A Logical Approach to Computational Corpus Linguistics http://www.ling.gu.se/~lager/taglog.html
A 1996 thesis by Torbjörn Lager. Abstract available, as well as full text in PostScript and PDF formats.
American National Corpus http://americannationalcorpus.org/
Information about this freely available database of American English.
Centre for Corpus Research http://www.corpus.bham.ac.uk/
At the University of Birmingham, England. Information on programmes, research and available resources.
Centre for English Corpus Linguistics http://juppiter.fltr.ucl.ac.be/FLTR/GERM/ETAN/CECL/cecl.html
At the Catholic University of Leuven, this institute focuses on cross-linguistic corpora and learner corpora. Research, events, staff, publications.
Clitic climbing in electronic corpora http://tesina.galleus.com/
Thesis study by Kertes Gábor that analyses the phenomenon of clitic climbing or clitic promotion. [Parallel Spanish and English]
Corpus Encoding Standard http://www.cs.vassar.edu/CES/
Application of SGML to corpus encoding. Covers the standard and projects currently using it.
ELRA catalog of language resources http://catalog.elra.info/
Various language resources and evaluation packages in the field of Human Language Technology (HLT) are available at ELRA (European Language Resources Association). Distribution is taken care of by ELRA's operational body: ELDA.
Free online parallel corpus http://korpus.hiztegia.org
This website allows you to search online for words in Basque, Polish, English, French or Spanish, and displays results in all these languages, aligned by paragraph.
Hungarian National Corpus http://corpus.nytud.hu/mnsz/index_eng.html
More than 150 million Hungarian words, a model of Hungarian language of the 1990s. Free and extensive query system. [Hungarian, English]
International Journal of Corpus Linguistics http://www.benjamins.com/cgi-bin/t_seriesview.cgi?series=IJCL
A journal published twice a year, presenting articles from linguists, lexicographers and language engineers. Contents, abstracts, submission information.
LDC - Linguistic Data Consortium http://ldc.upenn.edu/
The Linguistic Data Consortium (LDC) creates, collects and distributes speech and text databases, annotated corpora, treebanks, lexicons and other linguistic resources for research, education and development.
MRC Psycholinguistic Database http://www.psych.rl.ac.uk/
Web access to a large database of linguistic and psycholinguistic (but not semantic) data derived from a variety of sources.
National Corpus of Polish http://nkjp.pl/
The National Corpus of Polish is a publicly available, large, balanced and linguistically annotated corpus of polish.
Shallow Processing of Large Corpora Workshop 2003 http://www.bultreebank.org/ProgramSProLaC03.html
Held at Lancaster University. Presented papers are available in PDF format.
SIGANN: ACL Special Interest Group for Annotation http://www.cs.vassar.edu/sigann/
A subgroup of the Association for Computational Linguistics (ACL), this group is concerned with all aspects of linguistic annotation of language resources (linguistic corpora), especially the advancement of interoperability. Sponsors the annual Linguistic Annotation Workshop (LAW).
SIGDAT: ACL Special Interest Group for linguistic data and corpus-based approaches to NLP http://www.aclweb.org/sigdat
A subgroup of the Association for Computational Linguistics (ACL) which focuses on corpus-based and statistical methods in Natural Language Processing. Organizes the EMNLP conference (Empirical Methods in NLP) and the WVLC (Workshop on Very Large Corpora).
SIGWAC: ACL Special Interest Group on Web as Corpus http://www.sigwac.org.uk/
A subgroup of the Association for Computational Linguistics (ACL) which promotes interest in the use of the Internet as a source of linguistic data, and as an object of study in its own right. Organizes the WAC workshops.

Help build the largest human-edited directory on the web.
Submit a Site - Open Directory Project - Become an Editor

Modified by Google - ©2009 Google
Advertise with Us - Jobs, Press, Cool Stuff...