Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering
Assessing the Capacity of Transformer to Abstract Syntactic Representations: A Contrastive Analysis Based on Long-distance Agreement
On the Role of Negative Precedent in Legal Outcome Prediction
Meta-Learning a Cross-lingual Manifold for Semantic Parsing
OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue
Helpful Neighbors: Leveraging Neighbors in Geographic Feature Pronunciation
Locally Typical Sampling
Improving Low-Resource Cross-lingual Parsing with Expected Statistic Regularization
Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation
Modeling Emotion Dynamics in Song Lyrics with State Space Models
Word Acquisition in Neural Language Models
Decomposing and Recomposing Event Structure
FeTaQA: Free-form Table Question Answering
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation
Dealing with Disagreements: Looking Beyond the Majority Vote in Subjective Annotations
Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition
Out-of-Domain Discourse Dependency Parsing via Bootstrapping: An Empirical Analysis on Its Effectiveness and Limitation
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
SUMMAC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization
A Survey on Automated Fact-Checking
Predicting Document Coverage for Relation Extraction
ABNIRML: Analyzing the Behavior of Neural IR Models
Neuro-symbolic Natural Logic with Introspective Revision for Natural Language Inference
Neuro-symbolic Natural Logic with Introspective Revision for Natural Language Inference
Time-Aware Language Models as Temporal Knowledge Bases
Time-Aware Language Models as Temporal Knowledge Bases
Multilingual Autoregressive Entity Linking
ByT5: Towards a Token-Free Future with Pre-trained Byte-to-Byte Models
Designing an Automatic Agent for Repeated Language–based Persuasion Games
Towards General Natural Language Understanding with Probabilistic Worldbuilding
A Multi-Level Optimization Framework for End-to-End Text Augmentation
A Multi-Level Optimization Framework for End-to-End Text Augmentation
Evaluating Explanations: How Much Do Explanations from the Teacher Aid Students?
Evaluating Explanations: How Much Do Explanations from the Teacher Aid Students?
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups
Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation
Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation
PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains
PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation
Czech Grammar Error Correction with a Large and Diverse Corpus
Czech Grammar Error Correction with a Large and Diverse Corpus
TopiOCQA: Open-domain Conversational Question Answering with Topic Switching
TopiOCQA: Open-domain Conversational Question Answering with Topic Switching
A Neighborhood Framework for Resource-Lean Content Flagging
A Neighborhood Framework for Resource-Lean Content Flagging
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval