Publications

2016

Lin C, Miller T, Dligach D, Bethard S, Savova G. Improving Temporal Relation Extraction with Training Instance Augmentation. In: Proceedings of the 15th Workshop on Biomedical Natural Language Processing. Berlin, Germany: Association for Computational Linguistics; 2016. pp. 108–113.

Miller T, Dligach D, Savova G. Unsupervised Document Classification with Informed Topic Models. In: Proceedings of the 15th Workshop on Biomedical Natural Language Processing. Berlin, Germany: Association for Computational Linguistics; 2016. pp. 83–91.

Shain C, Bryce W, Jin L, Krakovna V, Doshi-Velez F, Miller T, Schuler W, Schwartz L. Memory-Bounded Left-Corner Unsupervised Grammar Induction on Child-Directed Input. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. Osaka, Japan: The COLING 2016 Organizing Committee; 2016. pp. 964–975.

This paper presents a new memory-bounded left-corner parsing model for unsupervised raw-text syntax induction, using unsupervised hierarchical hidden Markov models (UHHMM). We deploy this algorithm to shed light on the extent to which human language learners can discover hierarchical syntax through distributional statistics alone, by modeling two widely-accepted features of human language acquisition and sentence processing that have not been simultaneously modeled by any existing grammar induction algorithm: (1) a left-corner parsing strategy and (2) limited working memory capacity. To model realistic input to human language learners, we evaluate our system on a corpus of child-directed speech rather than typical newswire corpora. Results beat or closely match those of three competing systems.

2015

Miller, Bethard, Dligach, Lin, Savova. Extracting Time Expressions from Clinical Text. In: Proceedings of the 2015 Workshop on Biomedical Natural Language Processing (BioNLP 2015)Workshop on Biomedical Natural Language Processing. 2015.

Dligach D, Miller T, Savova GK. Semi-supervised Learning for Phenotyping Tasks. In: AMIA Annual Symposium Proceedings. 2015.

2014

Styler IV W, Bethard S, Finan S, Palmer M, Pradhan SS, Groen P, Erickson B, Miller TA, Lin C, Savova GK, et al. Temporal annotation in the clinical domain. Transactions of the łdots}. 2014;2:143–154.

Wu S, Miller T, Masanz J, Coarr M, Halgrim S, Carrell D, Clark C. Negation’s Not Solved: Generalizability Versus Optimizability in Clinical Natural Language Processing. PLoS ONE. 2014;9(11):e112774. doi:10.1371/journal.pone.0112774

Lin C, Karlson EW, Dligach D, Ramirez MP, Miller, Mo H, Braggs NS, Cagan, Gainer V, Denny JC, et al. Automatic identification of methotrexate-induced liver toxicity in patients with rheumatoid arthritis from the electronic medical record. Journal of the American Medical Informatics Association. 2014:23–30. doi:10.1136/amiajnl-2014-002642

Lin C, Miller T, Kho A, Bethard S, Dligach D, Pradhan S, Savova G. Descending-Path Convolution Kernel for Syntactic Structures. Acl. 2014;1:81–86.

Convolution tree kernels are an efficient and effective method for comparing syntac- tic structures in NLP methods. However, current kernel methods such as subset tree kernel and partial tree kernel understate the similarity of very similar tree structures. Although soft-matching approaches can im- prove the similarity scores, they are corpus- dependent and match relaxations may be task-specific. We propose an alternative ap- proach called descending path kernel which gives intuitive similarity scores on compa- rable structures. This method is evaluated on two temporal relation extraction tasks and demonstrates its advantage over rich syntactic representations.

2013

Miller T, Bethard S, Dligach D, Pradhan S, Lin C, Savova G. Discovering Temporal Narrative Containers in Clinical Text. Proceedings of the 2013 Workshop on Biomedical Natural Language Processing. 2013;(BioNLP):18–26.

Tim Miller

Associate Professor

Publications

2016

2015

2014

2013