Recommended on-line tools for corpus linguistics and NLP
Click here for recommended downloadable tools for off-line use
Updated: 2006-02-06
Using Internet Explorer ?
Instal the free
on top, and experience the difference in
online concordancing. Here
is one reason why.
PK's online corpus search (concordancer + bigrams + error search; PICLE and comparable EAP/ESP corpora)
Mark Davies' BNC-based Variation in English Words and Phrases
<NEW !> The HongKong PolyU English Department Language Bank (choose discipline, language, native/learner written/spoken corpus type)
<NEW !> Index of HongKong PolyU's corpora (e.g. Linguistics; also applied linguistics dissertations divided into Intros, Reviews and Conclusions; MicroConcord Corpus Collections, and many more)
MICASE (Michigan corpus of spoken academic US English; with associated audio files)
<NEW !> John Milton's Word Neighbors (compare word sequences across a variety of corpora + links to online dictionarie and thesauruses; also integrated with MarkMyWords and CheckMyWords)
Web Concordancer: CLT, VLC (LOB, Brown, newspaper, pedagogic corpora, learner corpora)
Online KWIC Concordancer: 1. BLC; 2. Bigram Plus (Business Letter Corpus and more)
Mark Davies' BNC-based Variation in English Words and Phrases
<NEW !> Pete Whitelock's Just the Word (check and group general English collocations; also integrated with MarkMyWords and CheckMyWords)
BNC World – simple search (Extended info on search patterns)[BNC Users Ref Guide]
BNC World – full search (IFA LAN / IFA users only: contact me if necessary) [SARA Manual] [Annotation info]
Cobuild Concordance and Collocations Sampler [10-Step Intro – by James Thomas]
Polish: 1. IPI PAN; 2. PWN; 3. PELCRA PNC (test version)
Other languages: KWICionary (currently German only, with English potentially to be added)
Web & website(s) as corpus: 1. WebCorp; 2. WebConc; 3. cf. KwiCFinder; 4. <NEW !> Leeds CQP ("Internet corpora")
Webpage(s) as corpus: 1. TurboLingo; 2. WordList Generator (WebCorp)
GlossaNet (free linguistic survey of on-line newspapers)
> Advanced linguistic search engines:
Phrases in English (n-grams, phrase-frames and POS-grams from the BNC)
Retrieve Collocation by Exemplar (Dekang Lin's demo)
> Literary concordancing:
Corpus- and web-based lexical research findings
Rzeczpospolita and PWN: Słowa Tygodnia
The Word Spy (not corpus-driven but web-informed lexpionage)
WebCALL:
Corpus-informed:
Compleat Lexical Tutor: CLT
Virtual Language Centre: VLC
<NEW !> MarkMyWords and CheckMyWords (for writing teachers and students; beta test versions; not always available)
Web-based Interactive Language Learning (IWiLL) and IWiLL Collocation Explorer
TeleNex (Sampler only) (Resource for English Teachers in Hong Kong; incl. Pattern Finder concordancer)
Tim John's Kibbitzer pages (EAP writing students' problems solved with concordance data)
MICASE Kibbitzer (spoken EAP language explained with corpus data – lexical, syntactic or discoursal)
Other (e.g. CMC):
Nicenet (Internet Classroom Assistant) (web-based course management system)
QuizStar (create online quizzes, incl. multiple-answer multiple choice)
4shared.com (user-friendly file sharing; commercial)
Online text analysis
Statistical tools
Web Site for Statistical Computation (Richard Lowry's online book and tools)
T-Score, Mutual Information and Observed-Expected Calculator
On-line (demo) text/sentence annotation
Electronic lexica
WordNet (also integrated with CLT concordancers)
Net Dictionary (an adaptation of WordNet, with TTS support, also intergrated with the VLC concordancers)
Voycabulary (makes the words on any webpage into links so you can look them up with just a click)
Wordcount (visual frequency list – illustrates the Zipfian distribution)
MT
TTS
More?
Last update:
2006-02-06
Page
maintained by Przemysław
Kaszubski