Artificial intelligence – Page 283

July 13, 2025

Summary of Varying Shades Of Wrong: Aligning Llms with Wrong Answers Only, by Jihan Yao et al.

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Onlyby Jihan Yao, Wenxuan Ding, Shangbin…

July 13, 2025

Summary of Assessing Bias in Metric Models For Llm Open-ended Generation Bias Benchmarks, by Nathaniel Demchak et al.

Assessing Bias in Metric Models for LLM Open-Ended Generation Bias Benchmarksby Nathaniel Demchak, Xin Guan,…

July 13, 2025

Summary of Practiq: a Practical Conversational Text-to-sql Dataset with Ambiguous and Unanswerable Queries, by Mingwen Dong et al.

PRACTIQ: A Practical Conversational Text-to-SQL dataset with Ambiguous and Unanswerable Queriesby Mingwen Dong, Nischal Ashok…

July 13, 2025

Summary of When Precedents Clash, by Cecilia Di Florio et al.

When Precedents Clashby Cecilia Di Florio, Huimin Dong, Antonino RotoloFirst submitted to arxiv on: 14…

July 13, 2025

Summary of Intelligent Prospector V2.0: Exploration Drill Planning Under Epistemic Model Uncertainty, by John Mern et al.

Intelligent prospector v2.0: exploration drill planning under epistemic model uncertaintyby John Mern, Anthony Corso, Damian…

July 13, 2025

Summary of Multilingual Controlled Generation and Gold-standard-agnostic Evaluation Of Code-mixed Sentences, by Ayushman Gupta et al.

Multilingual Controlled Generation And Gold-Standard-Agnostic Evaluation of Code-Mixed Sentencesby Ayushman Gupta, Akhil Bhogal, Kripabandhu GhoshFirst…

July 13, 2025

Summary of Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (nlp), By Mohammad Asif Ibna Mustafa (department Of Computation et al.

Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (NLP)by Mohammad Asif…

Summary of Varying Shades Of Wrong: Aligning Llms with Wrong Answers Only, by Jihan Yao et al.

Summary of Assessing Bias in Metric Models For Llm Open-ended Generation Bias Benchmarks, by Nathaniel Demchak et al.

Summary of Practiq: a Practical Conversational Text-to-sql Dataset with Ambiguous and Unanswerable Queries, by Mingwen Dong et al.

Summary of When Precedents Clash, by Cecilia Di Florio et al.

Summary of Intelligent Prospector V2.0: Exploration Drill Planning Under Epistemic Model Uncertainty, by John Mern et al.

Summary of Multilingual Controlled Generation and Gold-standard-agnostic Evaluation Of Code-mixed Sentences, by Ayushman Gupta et al.

Summary of Thinking Llms: General Instruction Following with Thought Generation, by Tianhao Wu et al.

Summary of Generative Ai and Its Impact on Personalized Intelligent Tutoring Systems, by Subhankar Maity et al.

Summary of Derail Yourself: Multi-turn Llm Jailbreak Attack Through Self-discovered Clues, by Qibing Ren et al.

Summary of Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (nlp), By Mohammad Asif Ibna Mustafa (department Of Computation et al.