Phil Humnicky/GU
AMTA-2004
The 6th Conference of the Association for Machine Translation in the Americas
Georgetown University, Washington DC
September 28 - October 2, 2004
Home

Call for Papers

Program At-A-Glance

Main Program

Invited Speakers

Accepted Papers

Tutorials

Workshops

Exhibits

Sponsors


Registration

Accommodations

Special Events

Local Information

Washington DC


Organizers

 

Tutorials

The AMTA 2004 Tutorials will be held on September 28th from 9 AM to 12 noon and from 2 to 5PM. The workshop titles follow (with organizers).

Morning Tutorials (9 AM - Noon, 28 September 2004)

Afternoon Tutorials (2 - 5 PM, 28 September 2004)

Tutorial costs for the tutorials are posted on the Registration page.

For any questions, please contact Mike Dillinger mike.dillinger@pobox.com.


Tutorial Abstracts:

Arabic Natural Language Processing

This tutorial will provide MT system developers/researchers with necessary background information for working with Modern Standard Arabic. The goal of the tutorial is to introduce Arabic linguistic phenomena that need to be addressed and review the state-of-the-art on Arabic processing. The focus will be on text-to-text MT from Modern Standard Arabic to English, but other applications with a cross-lingual component will be visited (such as cross-lingual information retrieval). Also some reference will be made to Arabic dialect research.

Dr Nizar Habash received his PhD in 2003 from the Computer Science Department, University of Maryland College Park. He is currently a Postdoctoral researcher at the Center for Computational Learning Systems in Columbia University. His research focuses on Machine Translation, Natural Language Generation and Lexical Semantics. His work on Arabic ranges from research in Arabic encoding issues to Arabic-English Machine Translation and computational modeling of Arabic dialects for Machine Translation and Automatic Speech Recognition.

Interlinguas & Semantic Markup

There has been a resurgence of interest in interlinguas and semantic roles, now focussed on semantic markup using XML, rather than on interlingua-based translation. XML, the Semantic Web, and interlinguas provide increasing amounts of semantic markup within the source text. This trend will have widespread effects on machine translation. In this tutorial, we will review recent efforts to design and use interlinguas and semantic markup, and the new prospects for applying them to translation.

Mike Dillinger is Director of Linguistics for Spoken Translation, Inc., where he supervises the creation of semantic knowledge bases for interactive translation, and consults on localization technologies for clients in industry. He was Director of Linguistic Development for Logos Corporation and for Global Words Technologies where he produced their Portuguese>English system, developed innovative workflow processes, and led the effort to integrate MT with tools for controlled language and translation memory. He also contributed to the Universal Networking Language Project of the United Nations University in Tokyo, where he wrote specifications for the Universal Networking Language interlingua as part of a global effort to implement distributed, interlingua-based machine translation. Previously, Dr. Dillinger did his doctoral research at McGill University in Canada on how interpreters carry out semantic processing during real-time speech-to-speech translation and taught semantics and other disciplines at more than a dozen universities in several countries, and has been a visiting researcher on four continents.

MT in Practice at PAHO

Participants will be active observers in every step in the translation process at the actual setting of PAHO Translation Services: receiving a request, creating an entry in the translation tracking system, MT processing, MT postediting, revision, final check, delivery, feedback and dictionary updating. Target audience: MT users, translators, developers.

Julia Aymerich studied Philology and Applied Linguistics at Universidad de Sevilla, Spain. Came to Washington in 1989 to study Computational Linguistics at Georgetown University. Has been working at PAHO Machine Translation program in 1989, first as a lexicographer, later as computational linguist and currently as senior computational linguist. Responsible for all MT development, user training, and tech support. Is and has been both developer and daily user of MT.

Mastering MT output

This tutorial demonstrates principles on how to master commercial MT systems with MT dictionary building with the goal of improving translation output.
Examples of MT postediting principles are also presented.

Jeff Allen is currently the Program Manager of Product Training and Knowledge Transfer at Mycom France in the Product Division of Mycom International.  He has also served on the MultiLingual Computing and Technology magazine Editorial Advisory Board for nearly four years.  After completing masters and doctoral degrees in general linguistics and computational linguistics at French universities, and teaching at several universities, he then worked on several Natural Language Processing projects/products concerning controlled language, translation systems / software, and speech systems: 1) trainer of technical writing and translation tools on the controlled language authoring and machine translation project at Caterpillar Inc; 2) Research Linguist on wearable speech-to-speech translation systems of the DIPLOMAT project at the Center for Machine Translation (Carnegie Mellon University); 3) Technical Director for several international projects funded by the Evaluation and Language Resources Distribution Agency and the European Commission; and 4) Director of Projects for commercial MT software at Softissimo.

Machine Translation: He has been working on MT dictionary building and MT postediting principles for the past decade. He is the author of the chapter"Postediting" in the book "Computers and Translation: A handbook for Translators" (John Benjamins, 2003) and has published extensively on topics related to MT systems and use.

What's New in Statistical Machine Translation

Accurate translation requires a great deal of knowledge about the usage and meaning of words, the structure of phrases, the meaning of sentences, and which real-life situations are plausible. Recently, there has been a fair amount of research into extracting translation-relevant knowledge automatically from large collections of manually-translated texts, and over the past years, several statistical MT projects have appeared in North America, Europe, and Asia, and the literature is growing substantially. We will overview this progress.

Philipp Koehn is the author of papers on natural language processing, machine translation, and machine learning. He received his PhD from the University of Southern California in 2003, and is currently employed as a postdoc at the Massachusetts Institute of Technology, working with Michael Collins. He has worked at AT&T Laboratories on text-to-speech systems, and at WhizBang! Labs on text categorization.

Kevin Knight is a Senior Research Scientist at the USC/Information Sciences Institute and an Research Associate Professor in the Computer Science Department at USC. He has written a number of articles on statistical MT, plus a widely-circulated MT workbook (http://www.isi.edu/natural-language/mt/wkbk.rtf). Dr. Knight has also given several MT-related talks at the AMTA and EMNLP conferences, such as "Statistical Machine Translation: Where Did It Go?", "Every Time I Fire a Statistician, I Get a Warm Fuzzy Feeling", and "Deeper Representations for Machine Translation: Ready or Not?"

XML and Localization: Optimizing your Content Supply Chain

This tutorial focuses on real-world case studies, showing how Fortune 500 companies have dramatically reduced localization and desktop publishing (DTP) costs with the implementation of XML-based technology. These solutions were designed to address the following problem areas related to localization:

  • inability to determine changes to content between localization cycles
  • requirement to publish localized content to multiple media types (e.g. paper, web)
  • reliance on Translation Memory tools to keep track of previously localized content
  • excessively high DTP and labor cost for publishing "difficult" languages (e.g., Asian languages, right-to-left languages, etc.)

The tutorial will cover the following topics:

  • Brief introduction to XML
  • Discussion of XML-based technologies for content management and single-source publishing
  • Applying XML technology to specific business issues related to localization
  • Business case studies
  • Technology demonstrations of applied XML concepts for multilingual production of content
Dan Dube is Director, Business Development, at Innodata Isogen. Mr. Dube has over 18 years of experience in business process re-engineering and implementation of complex information management systems, including three years as the Founder and President of Lighthouse Solutions Inc., an XML systems integration company. He has led the implementations of approximately seventy XML-based content management and publishing systems around the world in a variety of industries, including telecommunications, legal publishing, aviation, automotive, insurance and software documentation. He designed and managed the implementation of automated localization systems, based on XML components, that are currently used in production at several Global 2000 companies. Mr. Dube is the chief architect of the VisionMapping methodology used by Innodata Isogen consultants for business process analysis engagements. Most recently, he was interviewed in The LISA Newsletter for a special feature on multi-lingual publishing.

amta2004@amtaweb.org
Last updated: Tue, 21 September, 2004 19:01