Profile avatar
arxiv-cs-cl.bsky.social
Computer Science -- Computation and Language source: export.arxiv.org/rss/cs.CL maintainer: @tmaehara.bsky.social
26,032 posts 1,248 followers 0 following
Prolific Poster

Gibson Nkhata Shi Yin Hong, Susan Gauch Sarcasm Detection as a Catalyst: Improving Stance Detection with Cross-Target Capabilities https://arxiv.org/abs/2503.03787

I\~nigo Alonso, Ander Salaberria, Gorka Azkune, Jeremy Barnes, Oier Lopez de Lacalle Vision-Language Models Struggle to Align Entities across Modalities https://arxiv.org/abs/2503.03854

Emmy Liu, Amanda Bertsch, Lintang Sutawika, Lindia Tjuatja, Patrick Fernandes, Lara Marinov, Michael Chen, Shreya Singhal, Carolin Lawrence, ... Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions https://arxiv.org/abs/2503.03862

Faiz Surani, Mirac Suzgun, Vyoma Raman, Christopher D. Manning, Peter Henderson, Daniel E. Ho AI for Scaling Legal Reform: Mapping and Redacting Racial Covenants in Santa Clara County https://arxiv.org/abs/2503.03888

Sabur Butt, Hector G. Ceballos, Diana P. Madera Tec-Habilidad: Skill Classification for Bridging Education and Employment https://arxiv.org/abs/2503.03932

In Hak Moon Performance Comparison of Large Language Models on Advanced Calculus Problems https://arxiv.org/abs/2503.03960

Catherine Arnett, Tyler A. Chang, James A. Michaelov, Benjamin K. Bergen On the Acquisition of Shared Grammatical Representations in Bilingual Language Models https://arxiv.org/abs/2503.03962

Zongqian Li, Ehsan Shareghi, Nigel Collier ReasonGraph: Visualisation of Reasoning Paths https://arxiv.org/abs/2503.03979

Jiyue Jiang, Pengan Chen, Jiuming Wang, Dongchen He, Ziqin Wei, Liang Hong, Licheng Zong, Sheng Wang, Qinze Yu, Zixian Ma, Yanyu Chen, Yimin Fan, Xiangyu Shi, ... Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting https://arxiv.org/abs/2503.04013

Chenglong Wang, Haoyu Tang, Xiyuan Yang, Yueqi Xie, Jina Suh, Sunayana Sitaram, Junming Huang, Yu Xie, Zhaoya Gong, Xing Xie, Fangzhao Wu Uncovering inequalities in new knowledge learning by large language models across different languages https://arxiv.org/abs/2503.04064

Xiangnan Chen, Yuancheng Fang, Qian Xiao, Juncheng Li, Jun Lin, Siliang Tang, Yi Yang, Yueting Zhuang Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts https://arxiv.org/abs/2503.04095

Runtao Zhou, Guangya Wan, Saadia Gabriel, Sheng Li, Alexander J Gates, Maarten Sap, Thomas Hartvigsen Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English https://arxiv.org/abs/2503.04099

Zichong Li, Xinyu Feng, Yuheng Cai, Zixuan Zhang, Tianyi Liu, Chen Liang, Weizhu Chen, Haoyu Wang, Tuo Zhao LLMs Can Generate a Better Answer by Aggregating Their Own Responses https://arxiv.org/abs/2503.04104

Erik Jones, Arjun Patrawala, Jacob Steinhardt Uncovering Gaps in How Humans and LLMs Interpret Subjective Language https://arxiv.org/abs/2503.04113

Jiyue Jiang, Zikang Wang, Yuheng Shan, Heyan Chai, Jiayi Li, Zixian Ma, Xinrui Zhang, Yu Li Biological Sequence with Language Model Prompting: A Survey https://arxiv.org/abs/2503.04135

Xue Han, Qian Hu, Yitong Wang, Wenchun Gao, Lianlian Zhang, Qing Wang, Lijun Mei, Chao Deng, Junlan Feng Ticktack : Long Span Temporal Alignment of Large Language Models Leveraging Sexagenary Cycle Time Expression https://arxiv.org/abs/2503.04150

Chi Hang, Ruiqi Deng, Lavender Yao Jiang, Zihao Yang, Anton Alyakin, Daniel Alber, Eric Karl Oermann BPQA Dataset: Evaluating How Well Language Models Leverage Blood Pressures to Answer Biomedical Questions https://arxiv.org/abs/2503.04155

R. Patrick Xian, Qiming Cui, Stefan Bauer, Reza Abbasi-Asl Measuring temporal effects of agent knowledge by date-controlled tool use https://arxiv.org/abs/2503.04188

Bin Chen, Yu Zhang, Hongfei Ye, Ziyi Huang, Hongyang Chen Knowledge-Decoupled Synergetic Learning: An MLLM based Collaborative Approach to Few-shot Multimodal Dialogue Intention Recognition https://arxiv.org/abs/2503.04201

Ziyi Yang, Fanqi Wan, Longguang Zhong, Canbin Huang, Guosheng Liang, Xiaojun Quan FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion https://arxiv.org/abs/2503.04222

Jie He, Bo Peng, Yi Liao, Qun Liu, Deyi Xiong Tgea: An error-annotated dataset and benchmark tasks for text generation from pretrained language models https://arxiv.org/abs/2503.04232

Ruizhe Chen, Wenhao Chai, Zhifei Yang, Xiaotian Zhang, Joey Tianyi Zhou, Tony Quek, Soujanya Poria, Zuozhu Liu DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models https://arxiv.org/abs/2503.04240

Yana van de Sande, Gunes A\c{c}ar, Thabo van Woudenberg, Martha Larson On Fact and Frequency: LLM Responses to Misinformation Expressed with Uncertainty https://arxiv.org/abs/2503.04271

Muhammad Amien Ibrahim, Faisal, Tora Sangputra Yopie Winarto, Zefanya Delvin Sulistiya Dual-Class Prompt Generation: Enhancing Indonesian Gender-Based Hate Speech Detection through Data Augmentation https://arxiv.org/abs/2503.04279

Dilek K\"u\c{c}\"uk, Fazli Can Computational Law: Datasets, Benchmarks, and Ontologies https://arxiv.org/abs/2503.04305

Tadej \v{S}kvorc, Marko Robnik-\v{S}ikonja Solving Word-Sense Disambiguation and Word-Sense Induction with Dictionary Examples https://arxiv.org/abs/2503.04328

Wenhong Zhu, Weinan Zhang, Rui Wang Adding Alignment Control to Language Models https://arxiv.org/abs/2503.04346

Zhenghua Wang, Yiran Ding, Changze Lv, Zhibo Xu, Tianlong Li, Tianyuan Shi, Xiaoqing Zheng, Xuanjing Huang Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling https://arxiv.org/abs/2503.04355

Jiayi Chang, Mingqi Gao, Xinyu Hu, Xiaojun Wan Exploring the Multilingual NLG Evaluation Abilities of LLM-Based Evaluators https://arxiv.org/abs/2503.04360

Yafu Li, Ronghao Zhang, Zhilin Wang, Huajian Zhang, Leyang Cui, Yongjing Yin, Tong Xiao, Yue Zhang Lost in Literalism: How Supervised Training Shapes Translationese in LLMs https://arxiv.org/abs/2503.04369

Orfeas Menis Mastromichalakis, Giorgos Filandrianos, Maria Symeonaki, Giorgos Stamou Assumed Identities: Quantifying Gender Bias in Machine Translation of Ambiguous Occupational Terms https://arxiv.org/abs/2503.04372

Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Daniel Egert, Ellie Evans, Hoo-Chang Shin, Felipe Soares, Yi Dong, Oleksii Kuchaiev Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks https://arxiv.org/abs/2503.04378

Cheng-Han Chiang, Hung-yi Lee, Michal Lukasik TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge https://arxiv.org/abs/2503.04381

Shahar Levy, Nir Mazor, Lihi Shalmon, Michael Hassid, Gabriel Stanovsky More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG https://arxiv.org/abs/2503.04388

Tom Kouwenhoven, Max Peeperkorn, Roy de Kleijn, Tessa Verhoef Shaping Shared Languages: Human and Large Language Models' Inductive Biases in Emergent Communication https://arxiv.org/abs/2503.04395

Xinyi He, Yihao Liu, Mengyu Zhou, Yeye He, Haoyu Dong, Shi Han, Zejian Yuan, Dongmei Zhang TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models https://arxiv.org/abs/2503.04396

Sanjib Narzary, Bihung Brahma, Haradip Mahilary, Mahananda Brahma, Bidisha Som, Sukumar Nandi Comparative Study of Zero-Shot Cross-Lingual Transfer for Bodo POS and NER Tagging Using Gemini 2.0 Flash Thinking Experimental Model https://arxiv.org/abs/2503.04405

Hyunwoo Yoo Can Large Language Models Predict Antimicrobial Resistance Gene? https://arxiv.org/abs/2503.04413

Yifei Yuan, Anders S{\o}gaard Revisiting the Othello World Model Hypothesis https://arxiv.org/abs/2503.04421

Owen Cook, Yida Mu, Xinye Yang, Xingyi Song, Kalina Bontcheva A Dataset for Analysing News Framing in Chinese Media https://arxiv.org/abs/2503.04439

Micha{\l} Dolina, Jakub Dec, Stanis{\l}aw Dro\.zd\.z, Jaros{\l}aw Kwapie\'n, Jin Liu, Tomasz Stanisz Quantifying patterns of punctuation in modern Chinese prose https://arxiv.org/abs/2503.04449

Van Bach Nguyen, Christin Seifert, J\"org Schl\"otterer Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification https://arxiv.org/abs/2503.04463

Dimitri von R\"utte, Janis Fluri, Yuhui Ding, Antonio Orvieto, Bernhard Sch\"olkopf, Thomas Hofmann Generalized Interpolating Discrete Diffusion https://arxiv.org/abs/2503.04482

Zhenyu Wang, Zikang Wang, Jiyue Jiang, Pengan Chen, Xiangyu Shi, Yu Li Large Language Models in Bioinformatics: A Survey https://arxiv.org/abs/2503.04490

Wenke Huang, Jian Liang, Xianda Guo, Yiyang Fang, Guancheng Wan, Xuankun Rong, Chi Wen, Zekun Shi, Qingyun Li, Didi Zhu, Yanbiao Ma, Ke Liang, Bin Yang, He Li, ... Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model https://arxiv.org/abs/2503.04543

Zhipeng Chen, Yingqian Min, Beichen Zhang, Jie Chen, Jinhao Jiang, Daixuan Cheng, Wayne Xin Zhao, Zheng Liu, Xu Miao, Yang Lu, Lei Fang, Zhongyuan Wang, Ji-Rong Wen An Empirical Study on Eliciting and Improving R1-like Reasoning Models https://arxiv.org/abs/2503.04548

Armel Zebaze, Beno\^it Sagot, Rachel Bawden Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation https://arxiv.org/abs/2503.04554

Jacqueline R. M. A. Maasch, Alihan H\"uy\"uk, Xinnuo Xu, Aditya V. Nori, Javier Gonzalez Compositional Causal Reasoning Evaluation in Language Models https://arxiv.org/abs/2503.04556

Zhijian Zhuo, Yutao Zeng, Ya Wang, Sijun Zhang, Jian Yang, Xiaoqing Li, Xun Zhou, Jinwen Ma HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization https://arxiv.org/abs/2503.04598

Mohammad Amin Ghanizadeh, Mohammad Javad Dousti Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning https://arxiv.org/abs/2503.04611