Text Analysis Software and Spoken American Corpus

 

MonoConc Pro Version 2.0. (Demo version available at www.athel.com.)

Corpus of Spoken Professional American English, 2 million words with part-of-speech tags.

 

MonoConc Pro is widely used for teaching and research. The program has a well-designed user interface that allows novice users to load a text or corpus and start analysing language patterns with ease, and yet provides expert users with the powerful and configurable commands needed to work with large, complex, annotated texts.

Search

A corpus or text can be searched very quickly and easily. By choosing the basic Search command, the user can retrieve all examples of a word or phrase (or part of a word) from the corpus (one or more text files). Below, the search term speak has been entered. All instances of speak will be collected from the text-only files comprising the loaded corpus.


More complex search commands are also available when necessary, as the screen shot below reveals. Some of the possible search options are: Text search, Regular expression search, Tag (part-of-speech) search, Batch search, and various heading-sensitive and context-sensitive searches.

Results

The retrieved examples of the search term are typically displayed in a Keyword in Context format such that if the lines are sorted, patterns in the data can very easily be identified visually. (Once unwanted examples have been deleted, the results can be displayed and saved in a sentence format, with accompanying source file information if required.) Above the results window is a context window (reduced in size in the screen shot below). This context window provides up to 30K of context for each "hit." Various kinds of information relating to the results are displayed in the lower panels of this window.

 

 

Various frequency calculations can be performed, including collocate frequency, collocation frequency, as well as word frequency for the entire corpus (in frequency order or alphabetical order).

 

Corpora

MonoConc Pro will work with any size of corpus and with different languages. Various features of the software make it easy to work with both mark-up and part-of-speech tags.

 

Pricing (Educational)

Single user: $85 (Includes paperback manual, program disk and sample corpus disk.)

Site licence: $550 (2-year licence for up to 15 users)

 

Corpus of Spoken Professional American English (CSPAE)

The Corpus of Spoken Professional American English includes transcripts of conversations of various types occurring between 1994 and 1998. The corpus consists primarily of short interchanges by approximately 400 speakers that are centred on professional activities broadly tied to academics and politics, including academic politics. The seventeen files comprising the corpus contain over 2 million words. For comparison the Brown Corpus and LOB Corpus, popular corpora from the sixties, each contain around 1 million words.

The corpus, which has been constructed from a selection of existing transcripts of interactions in professional settings, contains two sub-corpora of a million words each. One sub-corpus consists mainly of academic discussions such as faculty council meetings and committee meetings related to testing. The second sub-corpus contains transcripts of White House press conferences and is almost exclusively question-and-answer sessions. The corpus is suitable for lexical and grammatical investigations of spoken usage, but not for close discourse analysis since backchannel, overlap and pause length information is not available in the corpus.

Both tagged and untagged files are included in the corpus. The Part-of-Speech tags are similar to those used in the British National Corpus (BNC). MonoConc Pro will allow the tags to be suppressed for easier analysis once a search has been performed.

CSPAE is available for $79 ($295 for a site licence for up to 15 users.) The untagged version of the corpus is $49. ($179 for the site licence.)


Catalogue:

MonoConc Pro 2.2
Concordance/Text analysis software. Requirements: Windows 95/XP/Vista etc. $85 (Educational price) ($550 2-year Site Licence)

MonoConc 1.5
No longer supported. For a similar program, check out MonoConc Easy

Concordance/Text Analysis software suitable for use by language learners and casual users. Requirements: Windows 3.x/95/98/2000/NT; Disk space: 1MB; RAM: 16MB. 1999. $69 ($425 2-year Site Licence)

Learning with Corpora
Contains articles by a variety of authors on the use of corpora in linguistic research, language teaching and translation studies. Edited by Guy Aston. 2001. $24.95

Concordances in the Classroom
Using concrete examples, the authors Chris Tribble and Glyn Jones show how a concordancer can be used to significantly enrich language and literature classes. 1997. $21.95

Corpus of Spoken Professional American English
The basic, tagged version includes both tagged and untagged files. 2000. $79 ($295 Site licence). Untagged version. $49 (Site Licence $179)

Concordance pack I
MonoConc 1.5, Corpus (CSPAE – untagged files), and Concordances in the Classroom by Tribble & Jones. $129

Concordance pack II
MonoConc Pro 2.0, Corpus (CSPAE – both tagged and untagged files) and Learning with Corpora Edited by Guy Aston. $159

Order form: Educational pricing
Athelstan, 5925 Kirby Drive, Suite E 464, Houston TX 77005
Email: info@athel.com Web orders: www.athel.com
Guarantee. Items found to be unsatisfactory for any reason can be returned within 30 days for a full refund.
Please send the following:
__ MonoConc Pro 2.2 @ $85/$550* ____
__ MonoConc Easy @ $45/$290* ____
__ CSPAE (tagged + untagged) @ $79/$295* ____
__ CSPAE (untagged) @ $49/$179* ____
__ Learning with Corpora @ $24.95 ____
__ Concordances in the Classrm @ $21.95 ____
__ Concordance pack I
@ $129 ____
__ Concordance pack II
@ $159 ____
Shipping and Handling: $8 (via UPS) within US and Canada. (Additional items $4.) $10 1st item + $4 ea additional item for airmail outside North America;. Total: ______

Ship/Bill to:
________________________________________________
________________________________________________
________________________________________________
________________________________________________
________________________________________________

Payment:
P.O. Number ___________ Cheque number _____
Credit card payment
Visa/MC/Amex Number ____________________________
Exp. Date ________ Name _________________________
Credit Card Billing Zip _____________________________

*Site licence price (2-year licence for up to 15 users)

Counts as two items for shipping purposes

Athelstan

5925 Kirby Drive, Suite E 464

Houston, TX 77005

U.S.A.

 

Email: info@athel.com

Fax: (713) 524-1159

Phone: (713) 371-2107

Web: www.athel.com