UMR 8135 CNRS - INaLCO

ELAN-CorpA

ELAN-CorpA is an extended version of the software ELAN from the Max Planck Institute for Psycholinguistics of Nijmegen, Netherlands. It has been developed firstly by Coralie Villes, then Uyên-To Rabier and finally Mourad Aouini under the direction of Christian Chanard from the CNRS research laboratory LLACAN (UMR 8135 of CNRS), initially for the research program ANR CorpAfroAs (A Spoken Corpus for Afro-Asiatic Languages), coordinated by Amina Mettouchi.

This version contains an additional tab 'Interlinearize' which allows the management of a XML lexicon (of extension .eafl) to annotate the words of a selected tier. If not found into the lexicon, a parser presents the possible segmentations into morphemes of the word. Those morphemes are then annotated on 2 additional tiers (gloss and category), with the contents of the lexicon.

A second lexicon (Parse Lexicon, with an .eafp extension) containing the words segmented and annotated from previous texts can be created or extended to speed up the annotation process.

Additionally, the table showing the hits found by a search in the annotated texts can display 2 more columns, 'parent' and 'children' (in addition to 'before' and 'after'), which contains the parent and the children of a search element (for example the word in which a morpheme has been found, the gloss and the category of this morpheme), respectively. Note: This has been incorporated into ELAN since the 4.6.1 release.

In the grid display, there is the new option to choose between 'multiple tiers with symbolic association' and 'multiple tiers with symbolic subdivision', which allows the display of the children of a tier depending on its stereotype. Note: this has been incorporated into ELAN since the 4.6.1 release.

Since version 4.7.4, a new item can be found in the Search menu: CorpA Multiple files Search where searching can be done by using CorpA Query Language

In this 4.8.0 version, a new feature Group & Link will allow you to create 2 tables:
  • Groups in which you can create groups of annotations chosen from the annotation area
  • Links in which you can link a source made of annotations (or predefined groups) to a target made of other annotations (or predefined groups)
These groups and links may each have a type (taken from a controlled vocabulary) and a name (label) to distinguish them. Then from the CorpA Multiple files Search module, you will be able to filter a request on the base of constraints like 'exist in a group', 'not exist in a group','exist in a source', 'not exist in a source', 'exist in a target', 'not exist in a target'.

Useful docs and tools:

  ELAN-CorpA   (click to download ELAN-CorpA)  

Windows:
  • After downloading, double-click install.exe
  • You do not need to install any other software. A Java virtual machine is included with this download.
Mac OS X:
  • After downloading, ELAN-CorpA.dmg should open automatically. Double-click on the install icon.
  • If you encounter a 'damaged file' issue when installing ELAN-CorpA, go there to learn how to allow the installation of a software not downloaded from Mac App Store (Apple menu > System Preferences… > Security & Privacy > General tab under the header "Allow applications downloaded from: Anywhere". After the installation, you may return to this place and change to the default value.)

    Once ELAN-CorpA is installed on a Mac, the program may hang up when loading the media file you chose to annotate ('Initializing...' problem) or when opening the media file of an ELAN file ('No player...' problem). To fix this: Go to Applications Folder, right-click on ELAN-CorpA_xxx, select Get Info, check the box run on 32 bits.

  • If it is not already on the desktop, make a shortcut of the ELAN-CorpA_xxx.jar and drag it to the desktop.
    Now use this shortcut to launch ELAN-CorpA.

You will find here a PDF document on how to use ELAN-CorpA

You will find here ( PC / Mac ) an ELAN template file Corpo1.etf that can be imported into ELAN to set up the types and tiers according to the CorpAfroAs model for one speaker, and here ( PC / Mac ) Corpo2.etf for two speakers. If the contents of those files open in your browser when downloading, just save them (file save as) on your computer.
To create a new ELAN file according to the CorpAfroAs template:

  • File, New, Browse to your Wav file, >> , ok
  • Delete the default tier (right-click on its label, delete default tier)
  • Tier, import Tiers
  • Browse to the downloaded file (Corpo1.etf for one speaker, Corpo2.etf for 2 speakers)
  • Import, Close
Windows: Mac:
ELAN-CorpA-474 ELAN-CorpA_474_install.exe ELAN-CorpA_474_install.zip
ELAN-CorpA-462 ELAN-CorpA_462_win_install.exe ELAN-CorpA_462_win_install.zip
ELAN-CorpA-440 install-ELAN-CorpA2.exe Install-ELAN-CorpA_440.zip
ELAN-CorpA2 install-CorpA2.exe install-CorpA2.zip
ELAN-CorpA-433 ELAN-CorpA_433.exe Install-ELAN-CorpA_433.zip