Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Judith Gelernter

Multi-lingual Geoparsing based on Machine Translation

Nov 06, 2015
Xu Chen, Han Zhang, Judith Gelernter

Figure 1 for Multi-lingual Geoparsing based on Machine Translation

Figure 2 for Multi-lingual Geoparsing based on Machine Translation

Figure 3 for Multi-lingual Geoparsing based on Machine Translation

Figure 4 for Multi-lingual Geoparsing based on Machine Translation

Our method for multi-lingual geoparsing uses monolingual tools and resources along with machine translation and alignment to return location words in many languages. Not only does our method save the time and cost of developing geoparsers for each language separately, but also it allows the possibility of a wide range of language capabilities within a single interface. We evaluated our method in our LanguageBridge prototype on location named entities using newswire, broadcast news and telephone conversations in English, Arabic and Chinese data from the Linguistic Data Consortium (LDC). Our results for geoparsing Chinese and Arabic text using our multi-lingual geoparsing method are comparable to our results for geoparsing English text with our English tools. Furthermore, experiments using our machine translation approach results in accuracy comparable to results from the same data that was translated manually.

* 7 pages, 4 figures,

Via

Access Paper or Ask Questions

Recognizing Extended Spatiotemporal Expressions by Actively Trained Average Perceptron Ensembles

Aug 19, 2015
Wei Zhang, Yang Yu, Osho Gupta, Judith Gelernter

Figure 1 for Recognizing Extended Spatiotemporal Expressions by Actively Trained Average Perceptron Ensembles

Figure 2 for Recognizing Extended Spatiotemporal Expressions by Actively Trained Average Perceptron Ensembles

Figure 3 for Recognizing Extended Spatiotemporal Expressions by Actively Trained Average Perceptron Ensembles

Figure 4 for Recognizing Extended Spatiotemporal Expressions by Actively Trained Average Perceptron Ensembles

Precise geocoding and time normalization for text requires that location and time phrases be identified. Many state-of-the-art geoparsers and temporal parsers suffer from low recall. Categories commonly missed by parsers are: nouns used in a non- spatiotemporal sense, adjectival and adverbial phrases, prepositional phrases, and numerical phrases. We collected and annotated data set by querying commercial web searches API with such spatiotemporal expressions as were missed by state-of-the- art parsers. Due to the high cost of sentence annotation, active learning was used to label training data, and a new strategy was designed to better select training examples to reduce labeling cost. For the learning algorithm, we applied an average perceptron trained Featurized Hidden Markov Model (FHMM). Five FHMM instances were used to create an ensemble, with the output phrase selected by voting. Our ensemble model was tested on a range of sequential labeling tasks, and has shown competitive performance. Our contributions include (1) an new dataset annotated with named entities and expanded spatiotemporal expressions; (2) a comparison of inference algorithms for ensemble models showing the superior accuracy of Belief Propagation over Viterbi Decoding; (3) a new example re-weighting method for active ensemble learning that 'memorizes' the latest examples trained; (4) a spatiotemporal parser that jointly recognizes expanded spatiotemporal expressions as well as named entities.

* 10 pages

Via

Access Paper or Ask Questions

Exploring Metaphorical Senses and Word Representations for Identifying Metonyms

Aug 19, 2015
Wei Zhang, Judith Gelernter

Figure 1 for Exploring Metaphorical Senses and Word Representations for Identifying Metonyms

Figure 2 for Exploring Metaphorical Senses and Word Representations for Identifying Metonyms

A metonym is a word with a figurative meaning, similar to a metaphor. Because metonyms are closely related to metaphors, we apply features that are used successfully for metaphor recognition to the task of detecting metonyms. On the ACL SemEval 2007 Task 8 data with gold standard metonym annotations, our system achieved 86.45% accuracy on the location metonyms. Our code can be found on GitHub.

* 9 pages, 8 pages content

Via

Access Paper or Ask Questions