Second Celtic Language Technology Workshop

(CLTW 2016)

JEP-TALN, Paris, 4 July 2016

http://www.lattice.cnrs.fr/CLTW/

Co-located with JEP-TALN, 4 July 2016, Paris, France

https://jep-taln2016.limsi.fr

New Submission Deadline: April 20, 2016

English version

Context

Language Tchnology and Computational Linguistics research innovations in recent years have given us a great deal of modern language processing tools and resources for many languages. Basic language tools like spell and grammar checkers through to interactive systems like Siri, as well as resources like the Trillion Word Corpus, all fit together to produce products and services which enhance our daily lives.

Until relatively recently, languages with smaller numbers of speakers have largely not benefited from attention in this field. However, modern techniques in the field are making it easier to create language tools and resources from fewer resources in a faster time. In this light, many lesser spoken languages are making their way into the digital age through the provision of language technologies and resources.

The Celtic Language Technology Workshop (CLTW) series of workshops provides a forum for researchers interested in developing NLP (Natural Language Processing) resources and technologies for Celtic languages. As Celtic languages are under-resourced, our goal is to encourage collaboration and communication between researchers working on language technologies and resources for Celtic languages.

This will be the second Celtic Language Technology Workshop (CLTW). The first one was held in Dublin during COLING 2014. We are pleased to organize this second event in France, during the JEP-TALN conference (https://jep-taln2016.limsi.fr).

Programme

9h15-9h30

Introduction

9h30-10 Monica Ward CALLIPSO – CALL for Irish for Parents Students and Others
10h-10h30

Teresa Lynn and Jennifer Foster – Universal Dependencies for Irish

10h30-11h

Coffee break

11h-11h30 Colin Batchelor – Automatic derivation of categorial grammar from a part-of-speech-tagged corpus in Scottish Gaelic
11h30-11h50 Mícheál Mac Lochlainn – Celtic Language Mark-up in WordPress
11h50-12h10 Oksana Dereza – Building a Dictionary-Based Lemmatizer for Old Irish
12h10-12h30

Annie Foret – Enrichissement de données en breton avec Wordnet

12h30-14h

Lunch

14h-14h50 Invited speaker: Robin Owain
14h50-15h10 Katie Ní Loingsigh – Towards a lexicon of Irish-language idioms
15h10-15h30 Dewi Bryn Jones, Gruffudd Prys and Delyth Prys – Vocab: a dictionary plugin for websites
15h30-16h

Meghan Dowling, Teresa Lynn, Yvette Graham and John Judge – English to Irish Machine Translation with Automatic Post-Editing

16h-16h30

Coffee break

16h30-17h00

William Lamb and Mark Sinclair – Developing Word Embedding Models for Scottish Gaelic

17-17h15 Wrap up

Areas of Interest

The CLTW welcomes both theoretical and practical submissions on any Celtic language that contribute to research in the area of automated language processing, language technologies or resources for same. We will particularly encourage studies that address the lack of resources available for a given language in this field.

Topics of interest for the CLTW workshop include but are not limited to:

Important Dates

New Paper submission deadline: April 20, 2016

Author notification deadline: May 10, 2016

Camera-ready paper deadline: May 27, 2016

How to Submit

Authors are invited to submit long papers (up to 12 pages + references) and short papers (up to 6 pages + references). Long papers should describe unpublished, substantial and completed original research. Short papers should be position papers, papers describing original work in progress or short, focused contributions.

Submissions will be accepted until April 15, 2016 (11:59 p.m. BST -- Brest Standard Time) in PDF format via the Easychair system (https://easychair.org/conferences/?conf=taln2016, select the "Celtic Language Technology Workshop" track) and must be formatted using the TALN style files.

The style files are available at (https://jep-taln2016.limsi.fr/styles/jeptaln2016-v2.zip) and papers must be submitted in PDF format.

Reviewing will be double-blind.

Language Policy

Submissions can be in French or English. English is only allowed if one of the co-authors does not speak French. Both French and English will be used as working languages during the workshop. Other Celtic languages are also welcome as working languages.

Programme committee

Workshop Organizers

Further Information

Further information is available on the workshop website at http://www.lattice.cnrs.fr/CLTW/ or by emailing the workshop organizers.