Tecling logo   Technologies for Linguistic Analysis
»The World is automatic

January 18, 2019: Sketch Engine Workshop with Vít Baisa

Vít Baisa is an NLP researcher and software developer focusing on exploitations of Sketch Engine for translators and terminologists.
This Workshop will take place in the auditorium of our building (ILCL), in Av. El Bosque 1290, Sausalito, Viña del Mar.



-------------------------------

December 11, 2018: We are conducting sociolinguistic research on the names of clothes in Chile


In the context of Javiera Sarmiento's thesis, we are conducting a survey on the names of clothes in the Spanish variant spoken in different regions in Chile. Javiera is interested to know if there are any differences in how people from different regions (especially Valparaíso and Rancagua) name different items of clothing. The survey is conducted in Spanish, but everybody is welcome to participate:

http://www.tecling.com/cgi-bin/tesis_js/



-------------------------------

December 3, 2018: Detecting syllables in a word in Spanish


In 2003 Ricardo Martínez built, with the help of Scott Sadowsky, an Excel routine to separate words in syllables in Spanish. We now implemented that algorithm as a Perl script, which can be more suitable for batch processing of massive volumes of data. At the moment, however, this online demo only processes one word at a time:

http://www.tecling.com/sicam



-------------------------------

November 15, 2018: First Workshop on Neology Research will take place in one month

On December 14, 2018 will the First Workshop on Neology Research take place (Jornadas de Investigación en Neología). This event is organized by Neovalpo.org Address: Instituto de Literatura y Ciencias del Lenguaje. Pontificia Universidad Católica de Valparaíso. Av. El Bosque 1290, Sausalito, Viña del Mar. Registration: https://goo.gl/nSKFfe



-------------------------------

If you have comments or questions, feel free to contact us.

 
Tools & demos

We have implemented different types of applications and most of them can be tested online. Take a look.

+ Bifid: a parallel corpus aligner

+ Dsele: a model dictionary for ELE learners

+ EMaD: automatic categorization of discouse markers

+ Estilector: a tool for assisted writing

+ GeNom: a program to detect the gender of proper nouns

+ Jaguar: a tool for statistic corpus analysis

+ Kind: a taxonomy induction algorithm

+ Kwico: a concordancer for big corpora

+ Neven: a program to detect eventive nouns

+ Termout: a terminology extraction system

+ POL: named entity recognition and classification

+ Poppins: a supervised text classifier

+ Porcus: an interface for various taggers and parsers for Spanish

+ Sapo: a program to detect similarities between documents

+ Sicam: a Perl implementation of Ricardo Martínez' Excel routine to separate a Spanish Word in syllables (new!)

+ Verbario: corpus pattern analysis in Spanish

 
Sausalito

This is the view from where we are located, in the Sausalito lagoon, a quiet and lovely place in Viña del Mar, Chile. Sunny days. Birds can be seen in the center of the lagoon (click to enlarge).

As researchers, we are currently affiliated to:
Pontificia Universidad Católica de Valparaíso
Instituto de Literatura y Ciencias del Lenguaje

Av. El Bosque 1290, Viña del Mar, Chile

Upcoming Events

January 15, 2019: Five of our students will present their theses tomorrow: Benjamín López, Ana Castro, Javier Obreque, Javiera Sarmiento and Valentina Ravest. Good luck guys.

January 18, 2019: Vít Baisa, from Sketch Engine, is coming to visit us and offer a Workshop on new and old features of the Sketch Engine software.

 
 

Latest ideas & research projects

We are developing new projects in computational linguistics and natural language processing.

+ Ecos-Sud (International Project between Chile and France): "Inducción automática de taxonomías del español y el francés mediante técnicas cuantitativas y estadística de corpus" (Ref. C16H02). Lead researcher: Irene Renau

+ Fondecyt Regular: "Desarrollo de la competencia terminológica a lo largo de la inserción disciplinar" (Ref. 11121597). Lead Researcher: Sabela Fernández. Co-researcher: Rogelio Nazar

+ There is More.

 
Recent publications

+ Irene Renau; Rogelio Nazar; Valesca Lecaros. (Forthcoming). "La evolución de las marcas ortográficas y tipográficas en los procesos de lexicalización de neologismos: un estudio en el vocabulario de la crisis económica en prensa española". Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics.

+ Robledo, H.; Nazar, R. (2018). "Clasificación automatizada de marcadores discursivos", Procesamiento del Lenguaje Natural, n. 61, pp 109-116.

+ Nazar, R. (Forthcoming). "El análisis cuantitativo de la coocurrencia léxica en la lexicografía especializada". Actas del VIII Congreso Internacional de Lexicografía Hispánica. Valencia, España: 27-29 Junio 2018.

+ Nazar, R. (2009 [2018]). Invitación al estudio estadístico del lenguaje. ArXiv:1804.07349 [stat.AP]
(PDF)

+ See more.

 

Solutions for text processing

It is critical for organizations to have the ability to process information automatically, and very often that information is contained in documents to be read by humans rather than machines. We have different methods for text processing depending on the goal.

We can be helpful teaching people how to automatize their text processing routines. We can batch-process thousands of documents to extract information from them or to derive different types of statistics. We can also change these document, or generate databases or email correspondence based on information extracted from them. Anything that involves intelligent management of information can benefit from different degrees of automatization, and by doing that we can free time, effort and resources.

Tell us which are your needs and we will show you what we can do about it.

 
    LogoAlt ABOUT || RESEARCH || SOFTWARE
Av. El Bosque 1290, Viña del Mar, Chile
+56 32 227 4424
Contact