21 September, 2018: Best paper award at SEPLN 2018

Hernán Robledo and Rogelio Nazar awarded the prize to the best paper at SEPLN 2018 (Seville, Spain) for their work entitled ``Clasificación automatizada de marcadores discursivos'' (Automatic discourse-marker categorization).


8 September, 2018: New version of Termout.org

Now comes with:

  • a better stoplist
  • a parameter for minimum frequency
  • a control for minimum and maximum size of the text
  • unseen elements are not discarded
  • non-utf8 material is discarded


21 August, 2018: new version in French, English and Spanish of the Taxonomy Project

We have a new web-demo of the project. It integrates all the algorithms and at the moment works in French, Enslish and Spanish. The user-interface is still pretty rough but the idea is that one can provide a noun (single nouns only, at the moment) and the program will try to assign the best semantic categories for such noun. It is also posible to provide a list of nouns (one per line) and the program will treat each noun as an independent trial.


12 August, 2018: EMaD: a new software for automatic categorization of discouse markers

In the context of the PhD thesis of Hernán Robledo, and in coincidence with the publication of our new paper on the subject, we present this new web demo to detect and classify discouse markers. (The program and the documentation is at the moment only in Spanish).


If you have comments or questions, feel free to contact us.

As researchers, we are currently affiliated to:
Pontificia Universidad Católica de Valparaíso
Instituto de Literatura y Ciencias del Lenguaje

Av. El Bosque 1290, Viña del Mar, Chile

Upcoming Events

16 October, 2018: at 13hs (GMT) Hernán Robledo will deliver a presentation at the Institute for Language, Cognition and Computation of the University of Edinburgh (Scotland). entitled ``A proposal for the inductive categorisation of discourse markers''. Address: Room 4.31/4.33, Informatics Forum of the University of Edinburgh. 10 Crichton Street, Edinburgh EH8 9AB


Latest ideas & research projects

We are developing new projects in computational linguistics and natural language processing.

+ Ecos-Sud (International Project between Chile and France): "Inducción automática de taxonomías del español y el francés mediante técnicas cuantitativas y estadística de corpus" (Ref. C16H02). Lead researcher: Irene Renau

+ Fondecyt Regular: "Desarrollo de la competencia terminológica a lo largo de la inserción disciplinar" (Ref. 11121597). Lead Researcher: Sabela Fernández. Co-researcher: Rogelio Nazar



Recent publications

+ Irene Renau; Rogelio Nazar; Valesca Lecaros. (Forthcoming). "La evolución de las marcas ortográficas y tipográficas en los procesos de lexicalización de neologismos: un estudio en el vocabulario de la crisis económica en prensa española". Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics.

+ Robledo, H.; Nazar, R. (2018). "Clasificación automatizada de marcadores discursivos", Procesamiento del Lenguaje Natural, n. 61, pp 109-116.

+ Nazar, R. (Forthcoming). "El análisis cuantitativo de la coocurrencia léxica en la lexicografía especializada". Actas del VIII Congreso Internacional de Lexicografía Hispánica. Valencia, España: 27-29 Junio 2018.

+ Nazar, R. (2009 [2018]). Invitación al estudio estadístico del lenguaje. ArXiv:1804.07349 [stat.AP]




