...
Kilgarriff, Adam (2012): „Getting to know your corpus“. In: Petr Sojka, Ales Horák, Ivan Kopecek, Karel Pala (eds.): Text, Speech and Dialogue. 15th International Conference, TSD 2012, Brno, Czech Republic, September 3-7, 2012, Proceedings. Berlin / Heidelberg: Springer (Lecture Notes in Computer Science, 7499), 3–15.
Krefeld, Thomas / Lücke, Stephan / Von Ehrlich, Isabel (2014): „Digitalianistica. Die italienische Philologie unterwegs in die digital humanities". In: Italienisch 72, 52-70. Download
Lemnitzer, Lothar / Zinsmeister, Heike (eds.) (2006): Korpuslinguistik. Eine Einführung. Tübingen: Narr.
...
Pusch, Claus D. (2014): „Les corpus romans contemporains“. In: Klump, A. / Kramer, J. / Willems, A. (eds.): Manuel des langues romanes. Berlin: De Gruyter (Manuals of Romance Linguistics, 1), 173-195.
Schmidt, Thomas / Wörner, Kai (eds.) (2012): Multilingual corpora and multilingual corpus analysis. Amsterdam / Philadelphia: Benjamins (Hamburg studies on multilingualism, 14).
...
Baroni, Marco / Bernardini, Silvia / Ferraresi, Adriano / Zanchetta, Eros (2009): „The WaCky Wide Web: A Collection of Very Large Linguistically Processed Web-Crawled Corpora”. In: Language Resources and Evaluation 43, 209–226. Download
Bacelar do Nascimento, M. F. (2000): "Corpus de Référence du Portugais Contemporain". In: BILGER, M. (ed.) Corpus, Méthodologie et Applications Linguistiques, Paris, H. Champion et Presses Universitaires de Perpignan (2000), 25-30. Download
Bellini, Daniele / Schneider, Stefan (2006): „Spoken Italian online: The Banca dati dell'italiano parlato (BADIP)”. In: Bernhard Kettemann, Georg Marko (eds.). Planing, Gluing and Painting Corpora. Inside the Applied Corpus Linguist's Workshop. Frankfurt am Main: Lang, 13-26.
Cresti, Emanuela/Moneglia, Massimo (eds.) (2005): C-ORAL-ROM. Integrated Reference Corpora for Spoken Romance Languages. Amsterdam / Philadelphia: Benjamins.
De Mauro, Tullio / Mancini, F. / Vedovelli, M. / Voghera, M. (1993): Lessico di frequenza dell’italiano parlato. Mailand: ETAS.
Durand, Jacques / Laks, Bernard / Lyche, Chantal (2002): La phonologie du français contemporain: usages, variétés et structure. In: Pusch, Claus / Raible, Wolfgang (eds.) Romanistische Korpuslinguistik- Korpora und gesprochene Sprache. Tübingen: Narr, 93-106(2009).Le projet PFC: une source de données primaires structurées. In: Iid. (eds): Phonologie, variation et accents du français. Paris: Hermès, 19-61.
Eisele, Andreas / Chen, Yu (2010): "MultiUN: A Multilingual Corpus from United Nation Documents".In: Tapias, Daniel et al. (eds.): Proceedings of the Seventh conference on International Language Resources and Evaluation. La Valletta, Malta, European Language Resources Association (ELRA), 2868-2872. Download
Généreux, Michel, Iris Hendrickx, Amália Mendes (2012): "Introducing the Reference Corpus of Contemporary Portuguese On-Line". In: Proceedings of the Eighth International Conference on Language Resources and Evaluation - LREC 2012, Istanbul, 21-27 de Maio de 2012, 2237-2244.
Kaminskaia, Svetlana / Poiré, François (2012): „Prosodie du français laurentien en milieu minoritaire : le corpus Windsor“. In: Anne Catherine Simon (ed.): La variation prosodique régionale en français. Louvain-La-Neuve: De Boeck (Champs linguistiques: Recueils), 159–178.
Koehn, Philipp (2005): Europarl: A Parallel Corpus for Statistical Machine Translation, MT Summit 2005. Download
MacWhinney, B. (1994): The CHILDES Project: Tools for analyzing Talk. Hillsdale, NJ: Lawrence Erlbaum Associates.
Pusch, Claus D. (2000): „Das Corpus Occitano-Gascon als Beispiel multimedialer Sprachdatenaufbereitung“. In: Rieger, A. (ed.): Okzitanistik, Altokzitanistik und Provenzalistik. Geschichte und Auftrag einer europäischen Philologie. Frankfurt a. M. et al.: Lang, 43-56.
Reese, Samuel et al. (2010): "Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus". In: Proceedings of 7th Language Resources and Evaluation Conference (LREC'10). La Valleta, Malta. Download
Rossini Favretti, Rema (1998): "Using multilingual parallel corpora for the analysis of legal language: the Bononia Legal Corpus". In: Translation Equivalence. Proceedings of the Third European Seminar, (W .Teubert., E. Tognini Bonelli , N. Volz eds.), The TELRI Association -. V., Institut für Deutsche Spräahe, The Tuscan Word Centre, 57-68. Download
Rossini Favretti, R. / Tamburini, F. / Martelli, E. (2007): "Words from Bononia Legal Corpus". In: Text Corpora and Multilingual Lexicography (W.Teubert ed.), John Benjamins, 11-30. Download
Schäfer, Roland (2015): "Processing and querying large web corpora with the COW14 architecture". In: Proceedings of Challenges in the Management of Large Corpora (CMLC-3). Download
Wäschle, K., Simianer, P., Bertoldi, N., Riezler, S., and Federico, M. (2013). "Generative and Discriminative Methods for Online Adaptation in SMT". In: Proceedings of Machine Translation Summit XIV. Nice, France. Download / PatTR-Korpus
Zampieri, M. / Becker, M. (2013): Colonia: Corpus of Historical Portuguese. In: ZSM Studien, Special Volume on Non-Standard Data Sources in Corpus-Based Research. Vol. 5. Shaker. Download
Korpora und gesprochene Sprache
...
Carruthers, Janice (2008): „Annotating an oral corpus using the Text Encoding Initiative. Methodology, problems, solutions”. In: Journal of French Language Studies 18/1, 103–119.
Durand, Jacques / Laks, Bernard / Lyche, Chantal (2002): La phonologie du français contemporain: usages, variétés et structure. In: Pusch, Claus / Raible, Wolfgang (eds.): Romanistische Korpuslinguistik- Korpora und gesprochene Sprache. Tübingen: Narr, 93-106.
MacWhinney, B. (1994): The CHILDES Project: Tools for analyzing Talk. Hillsdale, NJ: Lawrence Erlbaum Associates.
...