Freda Shi

Orcid: 0009-0009-5697-449X

Affiliations:
  • University of Waterloo, School of Computer Science, ON, Canada
  • Toyota Technological Institute, Chicago, IL, USA (PhD 2023)
  • Peking University, School of Electronic Engineering and Computer Science, Beijing, China (until 2018)


According to our database1, Freda Shi authored at least 44 papers between 2016 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
How Tokenization Limits Phonological Knowledge Representation in Language Models and How to Improve Them.
CoRR, April, 2026

DriveSOTIF: Advancing SOTIF Through Multimodal Large Language Models.
IEEE Trans. Veh. Technol., March, 2026

Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely Low-Resource Languages.
CoRR, March, 2026

A Very Big Video Reasoning Suite.
CoRR, February, 2026

From Tokens to Numbers: Continuous Number Modeling for SVG Generation.
CoRR, February, 2026

DriveLegal: Toward legally compliant driving via trustworthy hybrid retrieval-augmented LLMs.
Expert Syst. Appl., 2026

2025
The Mechanistic Emergence of Symbol Grounding in Language Models.
CoRR, October, 2025

Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents.
CoRR, October, 2025

DriveSOTIF: Advancing Perception SOTIF Through Multimodal Large Language Models.
CoRR, May, 2025

Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LingGym: How Far Are LLMs from Thinking Like Field Linguists?
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Logical forms complement probability in understanding language model (and human) performance.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Blessing of Multilinguality: A Systematic Analysis of Multilingual In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

FORG3D: Flexible Object Rendering for Generating Vision-Language Spatial Reasoning Data from 3D Scenes.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2025

SpaRE: Enhancing Spatial Reasoning in Vision-Language Models with Synthetic Data.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Learning Language Structures Through Grounding.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Gated Slot Attention for Efficient Linear-Time Sequence Modeling.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LogogramNLP: Comparing Visual and Textual Representations of Ancient Logographic Writing Systems for NLP.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models.
CoRR, 2023

Large Language Models Can Be Easily Distracted by Irrelevant Context.
Proceedings of the International Conference on Machine Learning, 2023

Language models are multilingual chain-of-thought reasoners.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

InCoder: A Generative Model for Code Infilling and Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Audio-Visual Neural Syntax Acquisition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Natural Language to Code Translation with Execution.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Deep Clustering of Text Representations for Supervision-Free Probing of Syntax.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2021

Grammar-Based Grounded Lexicon Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Substructure Substitution: Structured Data Augmentation for NLP.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Clustering Contextualized Representations of Text for Unsupervised Syntax Induction.
CoRR, 2020

A Cross-Task Analysis of Text Span Representations.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

On the Role of Supervision in Unsupervised Constituency Parsing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Visually Grounded Neural Syntax Acquisition.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Understanding and Improving Multi-Sense Word Embeddings via Extended Robust Principal Component Analysis.
CoRR, 2018

Implicit Subjective and Sentimental Usages in Multi-sense Word Embeddings.
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, 2018

Constructing High Quality Sense-specific Corpus and Word Embedding via Unsupervised Elimination of Pseudo Multi-sense.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

On Tree-Based Neural Sentence Modeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Learning Visually-Grounded Semantics from Contrastive Adversarial Samples.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Joint Saliency Estimation and Matching using Image Regions for Geo-Localization of Online Video.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

2016
Real Multi-Sense or Pseudo Multi-Sense: An Approach to Improve Word Representation.
Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity, 2016


  Loading...