Jaguar logo A Tool for Quantitative Corpus Analysis

Send us some feedback!

Questions, comments and bug reports are welcome.
Your message:

Jaguar is a tool for corpus exploitation. This software can analyze textual corpora from a user or from the web and it is currently available as a web application as well as a Perl module. The functions that are available at this moment are: vocabulary analysis of corpora, concordance extractions, n-gram sorting and measures of association, distribution and similarity.

Jaguar is essentially a Perl module instantiated as a web application. A web application has the advantage of being executable in any platform without installation procedures. However, with the module users are capable of building their own sequence of procedures, taking the output of a process to be the input of another process. The web interface has the limitation that only one procedure can be executed at a time, meaning that the output of a process has to be manually fed as input for the next process.

Since July 2016, this project is funded by the "Technological Prototyes" track of
the Innovation and Entrepeneurship 2016 Program of Pontificia Universidad Católica de Valparaíso.

The project is a full renovation and extension of the old "Jaguar Project" carried out at Universitat Pompeu Fabra in Barcelona from 2006 to 2012. The title of the current project is: "Jaguar: an open-source prototype for quantitative corpus analysis"

The results of this project will be officialy presented on January 11, 2017, 5 pm, at the university headquarters in Av. Brasil #2950, Valparaíso, Chile.

We are also planning to offer an introductory Workshop on the use of this tool in the summer of 2017, maybe in Valparaíso, maybe in Santiago, or maybe in both places. Drop a line if interested.

Related publications:

+ Nazar, R.; Vivaldi, J.; Cabré, MT. (2008). A Suite to Compile and Analyze an LSP Corpus. Proceedings of LREC 2008 (The 6th edition of the Language Resources and Evaluation Conference) Marrakech (Morocco), May 28-30, 2008.

A new paper with the description of the new version of the program is currently in preparation.