Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CL

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computation and Language

Authors and titles for recent submissions

  • Tue, 2 Dec 2025
  • Mon, 1 Dec 2025
  • Thu, 27 Nov 2025
  • Wed, 26 Nov 2025
  • Tue, 25 Nov 2025

See today's new changes

Total of 476 entries : 1-50 101-150 151-200 201-250 235-284 251-300 301-350 351-400 ... 451-476
Showing up to 50 entries per page: fewer | more | all

Mon, 1 Dec 2025 (continued, showing last 8 of 125 entries )

[235] arXiv:2511.22499 (cross-list from cs.CV) [pdf, html, other]
Title: What Shape Is Optimal for Masks in Text Removal?
Hyakka Nakada, Marika Kubota
Comments: 12 pages, 17 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[236] arXiv:2511.22367 (cross-list from cs.LG) [pdf, html, other]
Title: SuRe: Surprise-Driven Prioritised Replay for Continual LLM Learning
Hugo Hazard, Zafeirios Fountas, Martin A. Benfeghoul, Adnan Oomerjee, Jun Wang, Haitham Bou-Ammar
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[237] arXiv:2511.22333 (cross-list from cs.DC) [pdf, html, other]
Title: PAT: Accelerating LLM Decoding via Prefix-Aware Attention with Resource Efficient Multi-Tile Kernel
Jinjun Yi, Zhixin Zhao, Yitao Hu, Ke Yan, Weiwei Sun, Hao Wang, Laiping Zhao, Yuhao Zhang, Wenxin Li, Keqiu Li
Comments: Accepted by ASPLOS'26
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computation and Language (cs.CL)
[238] arXiv:2511.22311 (cross-list from cs.AI) [pdf, html, other]
Title: Swarms of Large Language Model Agents for Protein Sequence Design with Experimental Validation
Fiona Y. Wang, Di Sheng Lee, David L. Kaplan, Markus J. Buehler
Subjects: Artificial Intelligence (cs.AI); Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Soft Condensed Matter (cond-mat.soft); Computation and Language (cs.CL); Machine Learning (cs.LG)
[239] arXiv:2511.22232 (cross-list from cs.CV) [pdf, html, other]
Title: From Compound Figures to Composite Understanding: Developing a Multi-Modal LLM from Biomedical Literature with Medical Multiple-Image Benchmarking and Validation
Zhen Chen, Yihang Fu, Gabriel Madera, Mauro Giuffre, Serina Applebaum, Hyunjae Kim, Hua Xu, Qingyu Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[240] arXiv:2511.22150 (cross-list from cs.LG) [pdf, html, other]
Title: From Topology to Retrieval: Decoding Embedding Spaces with Unified Signatures
Florian Rottach, William Rudman, Bastian Rieck, Harrisen Scells, Carsten Eickhoff
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[241] arXiv:2511.21757 (cross-list from cs.CY) [pdf, html, other]
Title: Medical Malice: A Dataset for Context-Aware Safety in Healthcare LLMs
Andrew Maranhão Ventura D'addario
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[242] arXiv:2511.21750 (cross-list from cs.CV) [pdf, html, other]
Title: SO-Bench: A Structural Output Evaluation of Multimodal LLMs
Di Feng, Kaixin Ma, Feng Nan, Haofeng Chen, Bohan Zhai, David Griffiths, Mingfei Gao, Zhe Gan, Eshan Verma, Yinfei Yang, Zhifeng Chen, Afshin Dehghan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)

Thu, 27 Nov 2025 (showing first 42 of 68 entries )

[243] arXiv:2511.21692 [pdf, html, other]
Title: Revisiting Generalization Across Difficulty Levels: It's Not So Easy
Yeganeh Kordi, Nihal V. Nayak, Max Zuo, Ilana Nguyen, Stephen H. Bach
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[244] arXiv:2511.21689 [pdf, html, other]
Title: ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Hongjin Su, Shizhe Diao, Ximing Lu, Mingjie Liu, Jiacheng Xu, Xin Dong, Yonggan Fu, Peter Belcak, Hanrong Ye, Hongxu Yin, Yi Dong, Evelina Bakhturina, Tao Yu, Yejin Choi, Jan Kautz, Pavlo Molchanov
Comments: 21 pages, 6 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[245] arXiv:2511.21686 [pdf, html, other]
Title: Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework
Dong Wang, Yang Li, Ansong Ni, Ching-Feng Yeh, Youssef Emad, Xinjie Lei, Liam Robbins, Karthik Padthe, Hu Xu, Xian Li, Asli Celikyilmaz, Ramya Raghavendra, Lifei Huang, Carole-Jean Wu, Shang-Wen Li
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[246] arXiv:2511.21629 [pdf, html, other]
Title: The author is dead, but what if they never lived? A reception experiment on Czech AI- and human-authored poetry
Anna Marklová, Ondřej Vinš, Martina Vokáčová, Jiří Milička
Subjects: Computation and Language (cs.CL)
[247] arXiv:2511.21613 [pdf, html, other]
Title: Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining
Dongyang Fan, Diba Hashemi, Sai Praneeth Karimireddy, Martin Jaggi
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[248] arXiv:2511.21610 [pdf, html, other]
Title: Auxiliary Metrics Help Decoding Skill Neurons in the Wild
Yixiu Zhao, Xiaozhi Wang, Zijun Yao, Lei Hou, Juanzi Li
Comments: 7 pages, 7 figures. Includes additional appendix
Subjects: Computation and Language (cs.CL)
[249] arXiv:2511.21568 [pdf, html, other]
Title: RoParQ: Paraphrase-Aware Alignment of Large Language Models Towards Robustness to Paraphrased Questions
Minjoon Choi
Comments: 12 pages, 9 figures, 8 tables
Subjects: Computation and Language (cs.CL)
[250] arXiv:2511.21533 [pdf, other]
Title: Bangla Sign Language Translation: Dataset Creation Challenges, Benchmarking and Prospects
Husne Ara Rubaiyeat, Hasan Mahmud, Md Kamrul Hasan
Comments: 14 pages, 8 tables
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[251] arXiv:2511.21517 [pdf, html, other]
Title: Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation
Lina Conti, Dennis Fucci, Marco Gaido, Matteo Negri, Guillaume Wisniewski, Luisa Bentivogli
Comments: Submitted to LREC 2026
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[252] arXiv:2511.21473 [pdf, html, other]
Title: Hierarchical Ranking Neural Network for Long Document Readability Assessment
Yurui Zheng, Yijun Chen, Shaohong Zhang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[253] arXiv:2511.21437 [pdf, html, other]
Title: A Systematic Study of Model Merging Techniques in Large Language Models
Oğuz Kağan Hitit, Leander Girrbach, Zeynep Akata
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[254] arXiv:2511.21416 [pdf, html, other]
Title: Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning
Kaifeng Hong, Yinglong Zhang, Xiaoying Hong, Xuewen Xia, Xing Xu
Comments: 32 pages, 2 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[255] arXiv:2511.21402 [pdf, html, other]
Title: Text-to-SQL as Dual-State Reasoning: Integrating Adaptive Context and Progressive Generation
Zhifeng Hao, Qibin Song, Ruichu Cai, Boyan Xu
Subjects: Computation and Language (cs.CL)
[256] arXiv:2511.21401 [pdf, html, other]
Title: Can LLMs extract human-like fine-grained evidence for evidence-based fact-checking?
Antonín Jarolím, Martin Fajčík, Lucia Makaiová
Subjects: Computation and Language (cs.CL)
[257] arXiv:2511.21399 [pdf, html, other]
Title: Training Introspective Behavior: Fine-Tuning Induces Reliable Internal State Detection in a 7B Model
Joshua Fonseca Rivera
Comments: 16 pages, 8 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[258] arXiv:2511.21334 [pdf, html, other]
Title: Emergent Lexical Semantics in Neural Language Models: Testing Martin's Law on LLM-Generated Text
Kai Kugler
Comments: paper draft
Subjects: Computation and Language (cs.CL)
[259] arXiv:2511.21285 [pdf, html, other]
Title: PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark
Robert Belanec, Branislav Pecher, Ivan Srba, Maria Bielikova
Subjects: Computation and Language (cs.CL)
[260] arXiv:2511.21229 [pdf, other]
Title: Developing an Open Conversational Speech Corpus for the Isan Language
Adisai Na-Thalang, Chanakan Wittayasakpan, Kritsadha Phatcharoen, Supakit Buakaw
Comments: 31 pages, in Thai language, 3 figures, 25 tables
Subjects: Computation and Language (cs.CL)
[261] arXiv:2511.21218 [pdf, html, other]
Title: Can Finetuing LLMs on Small Human Samples Increase Heterogeneity, Alignment, and Belief-Action Coherence?
Steven Wang, Kyle Hunt, Shaojie Tang, Kenneth Joseph
Subjects: Computation and Language (cs.CL)
[262] arXiv:2511.21214 [pdf, html, other]
Title: Self-Guided Defense: Adaptive Safety Alignment for Reasoning Models via Synthesized Guidelines
Yuhang Wang, Yanxu Zhu, Dongyuan Lu, Jitao Sang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[263] arXiv:2511.21101 [pdf, html, other]
Title: MortgageLLM: Domain-Adaptive Pretraining with Residual Instruction Transfer, Alignment Tuning, and Task-Specific Routing
Manish Jain, Satheesh Kumar Ponnambalam, Salman Faroz, Chandrakanth Lns, Vinay Sharma
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[264] arXiv:2511.21088 [pdf, html, other]
Title: ASR Error Correction in Low-Resource Burmese with Alignment-Enhanced Transformers using Phonetic Features
Ye Bhone Lin, Thura Aung, Ye Kyaw Thu, Thazin Myint Oo
Comments: 7 pages, 2 figures, 7 tables, Accepted to iSAI-NLP 2025
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[265] arXiv:2511.21086 [pdf, html, other]
Title: Orthographic Constraint Satisfaction and Human Difficulty Alignment in Large Language Models
Bryan E. Tuck, Rakesh M. Verma
Subjects: Computation and Language (cs.CL)
[266] arXiv:2511.21081 [pdf, html, other]
Title: Enhancing Burmese News Classification with Kolmogorov-Arnold Network Head Fine-tuning
Thura Aung, Eaint Kay Khaing Kyaw, Ye Kyaw Thu, Thazin Myint Oo, Thepchai Supnithi
Comments: 6 pages, 2 figures, 4 tables, Accepted to iSAI-NLP 2025
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[267] arXiv:2511.21066 [pdf, html, other]
Title: Context-Aware Pragmatic Metacognitive Prompting for Sarcasm Detection
Michael Iskandardinata, William Christian, Derwin Suhartono
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[268] arXiv:2511.21038 [pdf, html, other]
Title: Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels
Anantha Padmanaban Krishna Kumar (Boston University)
Comments: 13 pages total (7 pages main text, 3 pages references, 3 pages appendix), 2 figures, 14 tables. Code available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[269] arXiv:2511.21006 [pdf, html, other]
Title: TrackList: Tracing Back Query Linguistic Diversity for Head and Tail Knowledge in Open Large Language Models
Ioana Buhnila, Aman Sinha, Mathieu Constant
Comments: under review
Subjects: Computation and Language (cs.CL)
[270] arXiv:2511.20940 [pdf, html, other]
Title: Chatty-KG: A Multi-Agent AI System for On-Demand Conversational Question Answering over Knowledge Graphs
Reham Omar, Abdelghny Orogat, Ibrahim Abdelaziz, Omij Mangukiya, Panos Kalnis, Essam Mansour
Comments: This paper is accepted to SIGMOD 2026
Subjects: Computation and Language (cs.CL)
[271] arXiv:2511.20910 [pdf, html, other]
Title: Emergence and Localisation of Semantic Role Circuits in LLMs
Nura Aljaafari, Danilo S. Carvalho, André Freitas
Subjects: Computation and Language (cs.CL)
[272] arXiv:2511.20872 [pdf, other]
Title: Winning with Less for Low Resource Languages: Advantage of Cross-Lingual English_Persian Argument Mining Model over LLM Augmentation
Ali Jahan, Masood Ghayoomi, Annette Hautli-Janisz
Comments: Preprint. Under review
Subjects: Computation and Language (cs.CL)
[273] arXiv:2511.20857 [pdf, html, other]
Title: Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory
Tianxin Wei, Noveen Sachdeva, Benjamin Coleman, Zhankui He, Yuanchen Bei, Xuying Ning, Mengting Ai, Yunzhe Li, Jingrui He, Ed H. Chi, Chi Wang, Shuo Chen, Fernando Pereira, Wang-Cheng Kang, Derek Zhiyuan Cheng
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[274] arXiv:2511.20849 [pdf, html, other]
Title: Length-MAX Tokenizer for Language Models
Dong Dong, Weijie Su
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[275] arXiv:2511.20836 [pdf, html, other]
Title: Structured Prompting Enables More Robust Evaluation of Language Models
Asad Aali, Muhammad Ahmed Mohsin, Vasiliki Bikia, Arnav Singhvi, Richard Gaus, Suhana Bedi, Hejie Cui, Miguel Fuentes, Alyssa Unell, Yifan Mai, Jordan Cahoon, Michael Pfeffer, Roxana Daneshjou, Sanmi Koyejo, Emily Alsentzer, Christopher Potts, Nigam H. Shah, Akshay S. Chaudhari
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[276] arXiv:2511.20820 [pdf, html, other]
Title: SAGE: An Agentic Explainer Framework for Interpreting SAE Features in Language Models
Jiaojiao Han, Wujiang Xu, Mingyu Jin, Mengnan Du
Subjects: Computation and Language (cs.CL)
[277] arXiv:2511.20799 [pdf, html, other]
Title: Memories Retrieved from Many Paths: A Multi-Prefix Framework for Robust Detection of Training Data Leakage in Large Language Models
Trung Cuong Dang, David Mohaisen
Comments: 11 pages, 2 tables, 8 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[278] arXiv:2511.20691 [pdf, other]
Title: LLMs-Powered Accurate Extraction, Querying and Intelligent Management of Literature derived 2D Materials Data
Lijun Shang, Yadong Yu, Wenqiang Kang, Jian Zhou, Dongyue Gao, Pan Xiang, Zhe Liu, Mengyan Dai, Zhonglu Guo, Zhimei Sun
Comments: 100 pages (18 pages main text, 82 pages supplementary material), 5 figures. Supplementary material starts from page 19
Subjects: Computation and Language (cs.CL); Materials Science (cond-mat.mtrl-sci); Databases (cs.DB)
[279] arXiv:2511.20683 [pdf, html, other]
Title: Dynamic Template Selection for Output Token Generation Optimization: MLP-Based and Transformer Approaches
Bharadwaj Yadavalli
Comments: 20 pages, 4 figures, includes production-scale experiments across OpenAI GPT-4, Google Gemini, and Anthropic Claude; code available upon request
Subjects: Computation and Language (cs.CL)
[280] arXiv:2511.20680 [pdf, other]
Title: Cognitive bias in LLM reasoning compromises interpretation of clinical oncology notes
Matthew W. Kenaston (1), Umair Ayub (1), Mihir Parmar (2), Muhammad Umair Anjum (1), Syed Arsalan Ahmed Naqvi (1), Priya Kumar (1), Samarth Rawal (1), Aadel A. Chaudhuri (4), Yousef Zakharia (3), Elizabeth I. Heath (5), Tanios S. Bekaii-Saab (3), Cui Tao (6), Eliezer M. Van Allen (7), Ben Zhou (2), YooJung Choi (2), Chitta Baral (2), Irbaz Bin Riaz (1 and 3 and 6) ((1) Mayo Clinic College of Medicine and Science, Phoenix, AZ, (2) School of Computing and AI, Arizona State University, Tempe, AZ, (3) Mayo Clinic Comprehensive Cancer Center, Phoenix, AZ, (4) Department of Radiation Oncology, Mayo Clinic, Rochester, MN, (5) Department of Oncology, Mayo Clinic, Rochester, MN, (6) Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, (7) Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA)
Comments: 24 pages, 6 figures, 1 supplementary figure, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[281] arXiv:2511.20677 [pdf, html, other]
Title: Prompt Engineering Techniques for Context-dependent Text-to-SQL in Arabic
Saleh Almohaimeed, May Alsofyani, Saad Almohaimeed, Mansour Al Ghanim, Liqiang Wang
Comments: Accepted at IJCNN 2025 (to appear in IEEE/IJCNN proceedings). This arXiv submission corresponds to the camera-ready version
Subjects: Computation and Language (cs.CL); Databases (cs.DB)
[282] arXiv:2511.20673 [pdf, html, other]
Title: Semantics Meet Signals: Dual Codebook Representationl Learning for Generative Recommendation
Zheng Hui, Xiaokai Wei, Reza Shirkavand, Chen Wang, Weizhi Zhang, Alejandro Peláez, Michelle Gong
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[283] arXiv:2511.20672 [pdf, other]
Title: MindSET: Advancing Mental Health Benchmarking through Large-Scale Social Media Data
Saad Mankarious, Ayah Zirikly, Daniel Wiechmann, Elma Kerz, Edward Kempa, Yu Qiao
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[284] arXiv:2511.20669 [pdf, html, other]
Title: Structured Definitions and Segmentations for Legal Reasoning in LLMs: A Study on Indian Legal Data
Mann Khatri, Mirza Yusuf, Rajiv Ratn Shah, Ponnurangam Kumaraguru
Comments: Accepted at BDA 2025 as short paper; This paper is long version
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Total of 476 entries : 1-50 101-150 151-200 201-250 235-284 251-300 301-350 351-400 ... 451-476
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status