The Bergen Corpus of London Teenage Language (COLT) is the first large English Corpus focusing on the speech of teenagers. It was collected in 1993 and consists of the spoken language of 13 to 17-year-old teenagers from different boroughs of London. The complete corpus, half a million words, has been orthographically transcribed and word-class tagged, and is a constituent of the British National Corpus (BNC).
The entire COLT Corpus is now available on the Internet. You can search for any word, collocation of words or letter combination by means of the TACTweb software. The search program can also show the distribution of an item in relation to factors such as age, sex, socioeconomic class, location etc.
From April 22th 1996 you will need a userid and password to access the whole corpus.
The CD-ROM version of COLT will be available from December 1996.
For more information, please contact:
Migle Miliauskaite, email firstname.lastname@example.org
Ingrida Strazdaite, email email@example.com