Tecling logo   Technologies for Linguistic Analysis
»The World is automatic

Latest ideas & research projects

We are developing new projects in computational linguistics and natural language processing.

+ Taxonomy Induction from corpora. Government funded Project (Conicyt-Fondecyt 11140686)

+ Automatic Extraction of patterns of use of Spanish verbs. Government funded Project (Conicyt-Fondecyt 11140704).

+ Design and implementation of writing aid software for university students. Project funded by the Pontifical Catholic University of Valparaíso

+ There is More.

15 February, 2018: the source code of Neven, the software for automatic detection of eventive nouns, is already published

Neven is a software for the automatic detection of non-deverbal eventive nouns, which are those nouns that designate events but have not experienced a process of derivation from verbs, such as fiesta ('party') or cóctel ('cocktail'). These nouns do not present the typical morphological features of deverbal nouns, such as -ción, -miento, and are therefore more difficult to detect. The publication of the source code was pending work and now it is available.

15 February, 2018: we present a new program to detect similarities between documents

We present a new software to detect textual similarities between documents. The user can upload a zip file with documents in a variety of formats (e.g., doc, docx, pdf, html) and the program will return a ranking with the pairs of most similar documents. Algorithm and implementation by Rogelio Nazar. This project has been supported by the Dsic and the ILCL of the PUCV.
The documentation of the software is pending work. Also the interface in English (now it is available only in Spanish).

25 January, 2018: Termout, the Terminology Extraction System, is now available for Spanish and English

Termout.org is the first implementation of a new method for terminology extraction based on distributional analysis. The intuition behind the algorithm is that single or multi-word lexical units that refer to specialised concepts will show a characteristic co-occurrence pattern, described as a tendency to appear in the same contexts with other conceptually related terms.
At the moment it works only in Spanish and English .
We have only tested it with Psychiatry papers, but you can try it with other domains as well. At first it will be slow, but then it gets faster with more experience.
The web demo is now available at:
More details are also available here:


If you have comments or questions, feel free to contact us.

Tools & demos
We have implemented different types of applications and most of them can be tested online. Take a look.

+ Bifid: a parallel corpus aligner

+ Dsele: a model dictionary for ELE learners

+ Estilector: a tool for assisted writing

+ GeNom: a program to detect the gender of proper nouns

+ Jaguar: a tool for statistic corpus analysis

+ Kind: a taxonomy induction algorithm

+ Kwico: a concordancer for big corpora

+ Neven: a program to detect eventive nouns

+ Termout: a terminology extraction system

+ POL: named entity recognition and classification

+ Poppins: a supervised text classifier

+ Sapo: a program to detect similarities between documents (New!)

+ Verbario: corpus pattern analysis in Spanish

This is the view from where we are located, in the Sausalito lagoon, a quiet and lovely place in Viña del Mar, Chile. Sunny days. Birds can be seen in the center of the lagoon (click to enlarge).

As researchers, we are currently affiliated to:
Pontificia Universidad Católica de Valparaíso
Escuela Lingüística de Valparaíso
Instituto de Literatura y Ciencias del Lenguaje

Av. El Bosque 1290, Viña del Mar, Chile

Upcoming Events

April, 2018: A new edition of the Perl Workshop will take place at the Institute for Literary and Linguistic Studies of the Pontificia Universidad Católica de Valparaíso. If you are interested, please contact us.

Recent publications

+ Nazar, R.; Soto, R.; Urrejola, K. (En prensa). Detección automática de nombres eventivos no deverbales en castellano: un enfoque cuantitativo basado en corpus. Revista Linguamatica.

+ Nazar, R.; Arriagada, P. (2017). POL: un nuevo sistema para la detección y clasificación de nombres propios. Procesamiento del Lenguaje Natural, n. 58, pp. 13-20.

+ Nazar, R., Renau, I. (2016). “A Quantitative analysis of the semantics of verb-argument structures”. In S. Torner and E. Bernal (eds.), Collocations and other lexical combinations in Spanish. Theoretical and Applied approaches. New York: Routledge, pp. 92-109.

+ See more.


Solutions for text processing

We have different methods for text processing depending on the goal. And we like to get in trouble! Check out our solutions for different problems, and get in touch.

+ More.

      LogoAlt Contact