Publications
2025
- EMNLPV-SEAM: Visual Semantic Editing and Attention Modulating for Causal Interpretability of Vision-Language Models In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025)
- EMNLPSciEvent: Benchmarking Multi-domain Scientific Event Extraction In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025)
- arXiv
- ICLRNo Preference Left Behind: Group Distributional Preference Optimization The Thirteenth International Conference on Learning Representations (ICLR 2025) [Abs]
2024
- EMNLPBenchmarking Machine Translation with Cultural Awareness In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings 2024) [Abs]
- COLMHow Well Do LLMs Identify Cultural Unity in Diversity? In First Conference on Language Modeling (COLM 2024) [Abs]
- arXiv
- NAACLCPopQA: Ranking cultural concept popularity by LLMs In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) [Abs]
- CVPRLookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024) [Abs]
2023
- Big DataPHAED: A speaker-aware parallel hierarchical attentive encoder-decoder model for multi-turn dialogue generation IEEE Transactions on Big Data [Abs]
2022
- DissertationThe influence of optical character recognition quality on the robustness of semantic encoding Thesis at University of Illinois Urbana Champaign [Abs]
- JCDLA prototype Gutenberg-Hathitrust sentence-level parallel corpus for OCR error analysis: Pilot investigations In Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries [Abs]
- IJDLEvaluating BERT-based scientific relation classifiers for scholarly knowledge graph construction on digital library collections International Journal on Digital Libraries [Abs]
2021
- CHRImpact of OCR Quality on BERT Embeddings in the Domain Classification of Book Excerpts In Proceedings of the Second Conference on Computational Humanities Research [Abs]
- JCDLEvaluating BERT’s Encoding of Intrinsic Semantic Features of OCR’d Digital Library Collections In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (Poster)
- iConferenceThe Gutenberg-HathiTrust Pallel Corpus: A Real-World Dataset for Noise Investigation in Uncorrected OCR Texts iConference 2021 (Poster) [Abs]
2020
- ICADLImproving Scholarly Knowledge Representation: Evaluating BERT-based Models for Scientific Relation Classification Runner-up of Best Student Paper Award In Proceedings of the 22nd International Conference on Asia-Pacific Digital Libraries (ICADL 2020)
- ASIS&TTargeting precision: A Hybrid Scientific Relation Extraction Pipeline for Improved Scholarly Knowledge Organization In Proceedings of the 83rd Annual Meeting of the Association for Informatin Science and Technology (ASIS&T 2020)
- JCDLImproving Digital Libraries’ Provision of Digital Humanties Datasets: A Case Study of HTRC Literature Dataset In 2020 ACM/IEEE Joint Conference on Digital Libraries (JCDL 2020) [Abs]
2019
- EMNLPTIGEr: Text-to-Image Grounding for Image Caption Evaluation In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019) [Abs] [Code]
- EMNLPREO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019) [Abs]
- TextGraphsA Constituency Parsing Tree based Method for Relation Extraction from Abstracts of Scholarly Publications In Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing at EMNLP-IJCNLP 2019 [Abs]
2018
- SUNBELTReliable Construction of Semantic Networks based on Text Data and Measurement of Effects in Text-based Networks International Network for Social Network Analysis (SUNBELT 2018)
2016
- COLINGSays Who...? Identification of Expert versus Layman Critics’ Reviews of Documentary Films In Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers (COLING 2016) [Abs]
- HypertextIssue-focused Documentaries Versus Other Films: Rating and Type Prediction based on User-authored Reviews In Proceedings of the 27th ACM Conference on Hypertext and Social Media (Hypertext 2016) [Abs]
- iConference
2013
- Geomatica
- SOCAA Novel Information Search and Recommendation Services Platform based on an Indexing Network In 2013 IEEE 6th International Conference on Service-Oriented Computing and Applications (SOCA 2013) [Abs]
- TREC