Paper List
We recommend you use the search box as this list is very long.
-
Summary of Handwritten Code Recognition For Pen-and-paper Cs Education, by Md Sazzad Islam et al.
-
Summary of Longitudinal Evaluation Of Child Face Recognition and the Impact Of Underlying Age, by Surendra Singh et al.
-
Summary of Grif-dm: Generation Of Rich Impression Fonts Using Diffusion Models, by Lei Kang et al.
-
Summary of On Effects Of Steering Latent Representation For Large Language Model Unlearning, by Dang Huu-tien et al.
-
Summary of Strategy Game-playing with Size-constrained State Abstraction, by Linjie Xu et al.
-
Summary of Fleurs-r: a Restored Multilingual Speech Corpus For Generation Tasks, by Min Ma and Yuma Koizumi and Shigeki Karita and Heiga Zen and Jason Riesa and Haruko Ishikawa and Michiel Bacchiani
-
Summary of Moviesum: An Abstractive Summarization Dataset For Movie Screenplays, by Rohit Saxena et al.
-
Summary of Owl2vec4oa: Tailoring Knowledge Graph Embeddings For Ontology Alignment, by Sevinj Teymurova et al.
-
Summary of Visualagentbench: Towards Large Multimodal Models As Visual Foundation Agents, by Xiao Liu et al.
-
Summary of Algorithm Research Of Elmo Word Embedding and Deep Learning Multimodal Transformer in Image Description, by Xiaohan Cheng et al.
-
Summary of Evaluating Llms on Entity Disambiguation in Tables, by Federico Belotti and Fabio Dadda and Marco Cremaschi and Roberto Avogadro and Matteo Palmonari
-
Summary of Towards Autonomous Agents: Adaptive-planning, Reasoning, and Acting in Language Models, by Abhishek Dutta and Yen-che Hsiao
-
Summary of Rethinking the Alignment Of Psychotherapy Dialogue Generation with Motivational Interviewing Strategies, by Xin Sun et al.
-
Summary of Cross-lingual Conversational Speech Summarization with Large Language Models, by Max Nelson et al.
-
Summary of Benchmarking Tree Species Classification From Proximally-sensed Laser Scanning Data: Introducing the For-species20k Dataset, by Stefano Puliti et al.
-
Summary of Hdrgs: High Dynamic Range Gaussian Splatting, by Jiahao Wu et al.
-
Summary of Aquilamoe: Efficient Training For Moe Models with Scale-up and Scale-out Strategies, by Bo-wen Zhang et al.
-
Summary of Social Debiasing For Fair Multi-modal Llms, by Harry Cheng et al.
-
Summary of A Perspective on Large Language Models, Intelligent Machines, and Knowledge Acquisition, by Vladimir Cherkassky and Eng Hock Lee
-
Summary of Amuro and Char: Analyzing the Relationship Between Pre-training and Fine-tuning Of Large Language Models, by Kaiser Sun et al.
-
Summary of Dc3do: Diffusion Classifier For 3d Objects, by Nursena Koprucu et al.
-
Summary of Simple but Effective Compound Geometric Operations For Temporal Knowledge Graph Completion, by Rui Ying and Mengting Hu and Jianfeng Wu and Yalan Xie and Xiaoyi Liu and Zhunheng Wang and Ming Jiang and Hang Gao and Linlin Zhang and Renhong Cheng
-
Summary of Enhancing Visual Dialog State Tracking Through Iterative Object-entity Alignment in Multi-round Conversations, by Wei Pang and Ruixue Duan and Jinfu Yang and Ning Li
-
Summary of Separate Generation and Evaluation For Parallel Greedy Best-first Search, by Takumi Shimoda and Alex Fukunaga
-
Summary of Deformable Image Registration with Multi-scale Feature Fusion From Shared Encoder, Auxiliary and Pyramid Decoders, by Hongchao Zhou and Shunbo Hu
-
Summary of Top Pass: Improve Code Generation by Pass@k-maximized Code Ranking, By Zhi-cun Lyu et al.
-
Summary of Reference-free Hallucination Detection For Large Vision-language Models, by Qing Li et al.
-
Summary of An Analysis Of Hoi: Using a Training-free Method with Multimodal Visual Foundation Models When Only the Test Set Is Available, Without the Training Set, by Chaoyi Ai
-
Summary of Neurosymbolic Methods For Rule Mining, by Agnieszka Lawrynowicz et al.
-
Summary of Seg-cyclegan : Sar-to-optical Image Translation Guided by a Downstream Task, By Hannuo Zhang et al.
-
Summary of Hatesieve: a Contrastive Learning Framework For Detecting and Segmenting Hateful Content in Multimodal Memes, by Xuanyu Su et al.
-
Summary of Real-time Drowsiness Detection Using Eye Aspect Ratio and Facial Landmark Detection, by Varun Shiva Krishna Rupani et al.
-
Summary of Robust Domain Generalization For Multi-modal Object Recognition, by Yuxin Qiao et al.
-
Summary of Open Role-playing with Delta-engines, by Hongqiu Wu et al.
-
Summary of The Cognitive Revolution in Interpretability: From Explaining Behavior to Interpreting Representations and Algorithms, by Adam Davies et al.
-
Summary of Weakly Supervised Video Anomaly Detection and Localization with Spatio-temporal Prompts, by Peng Wu et al.
-
Summary of Spb3dtracker: a Robust Lidar-based Person Tracker For Noisy Environment, by Eunsoo Im et al.
-
Summary of A New Pipeline For Generating Instruction Dataset Via Rag and Self Fine-tuning, by Chih-wei Song et al.
-
Summary of Freehand Sketch Generation From Mechanical Components, by Zhichao Liao et al.
-
Summary of An Investigation Into Explainable Audio Hate Speech Detection, by Jinmyeong An et al.
-
Summary of Exploring and Learning Structure: Active Inference Approach in Navigational Agents, by Daria De Tinguy and Tim Verbelen and Bart Dhoedt
-
Summary of Online Optimization Of Curriculum Learning Schedules Using Evolutionary Optimization, by Mohit Jiwatode et al.
-
Summary of Dynamic Blocked Clause Elimination For Projected Model Counting, by Jean-marie Lagniez et al.
-
Summary of From Text to Insight: Leveraging Large Language Models For Performance Evaluation in Management, by Ning Li et al.
-
Summary of Revisiting Multi-modal Llm Evaluation, by Jian Lu et al.
-
Summary of Vacode: Visual Augmented Contrastive Decoding, by Sihyeon Kim et al.
-
Summary of Car: Contrast-agnostic Deformable Medical Image Registration with Contrast-invariant Latent Regularization, by Yinsong Wang et al.
-
Summary of Fistech: Financial Style Transfer to Enhance Creativity Without Hallucinations in Llms, by Sohini Roychowdhury et al.
-
Summary of Shield: Llm-driven Schema Induction For Predictive Analytics in Ev Battery Supply Chain Disruptions, by Zhi-qi Cheng et al.
-
Summary of Style-preserving Lip Sync Via Audio-aware Style Reference, by Weizhi Zhong et al.
-
Summary of High-fidelity and Lip-synced Talking Face Synthesis Via Landmark-based Diffusion Model, by Weizhi Zhong et al.
-
Summary of Epam-net: An Efficient Pose-driven Attention-guided Multimodal Network For Video Action Recognition, by Ahmed Abdelkawy et al.
-
Summary of Investigating Instruction Tuning Large Language Models on Graphs, by Kerui Zhu et al.
-
Summary of Multi-agent Planning Using Visual Language Models, by Michele Brienza et al.
-
Summary of Structure and Reduction Of Mcts For Explainable-ai, by Ronit Bustin and Claudia V. Goldman
-
Summary of Disentangled Noisy Correspondence Learning, by Zhuohang Dang et al.
-
Summary of Multi-layer Sequence Labeling-based Joint Biomedical Event Extraction, by Gongchi Chen et al.
-
Summary of Document-level Event Extraction with Definition-driven Icl, by Zhuoyuan Liu et al.
-
Summary of In-context Exploiter For Extensive-form Games, by Shuxin Li et al.
-
Summary of Metacognitive Myopia in Large Language Models, by Florian Scholten et al.
-
Summary of Urfound: Towards Universal Retinal Foundation Models Via Knowledge-guided Masked Modeling, by Kai Yu et al.
-
Summary of Prtgaussian: Efficient Relighting Using 3d Gaussians with Precomputed Radiance Transfer, by Libo Zhang et al.
-
Summary of Stealthdiffusion: Towards Evading Diffusion Forensic Detection Through Diffusion Model, by Ziyin Zhou et al.
-
Summary of Ai For Operational Methane Emitter Monitoring From Space, by Anna Vaughan et al.
-
Summary of Data-driven Pixel Control: Challenges and Prospects, by Saurabh Farkya et al.
-
Summary of Auggs: Self-augmented Gaussians with Structural Masks For Sparse-view 3d Reconstruction, by Bi’an Du et al.
-
Summary of Glitchprober: Advancing Effective Detection and Mitigation Of Glitch Tokens in Large Language Models, by Zhibo Zhang et al.
-
Summary of Ensemble Bert: a Student Social Network Text Sentiment Classification Model Based on Ensemble Learning and Bert Architecture, by Kai Jiang et al.
-
Summary of Towards a Generative Approach For Emotion Detection and Reasoning, by Ankita Bhaumik et al.
-
Summary of Unleashing Artificial Cognition: Integrating Multiple Ai Systems, by Muntasir Adnan et al.
-
Summary of Avoid Wasted Annotation Costs in Open-set Active Learning with Pre-trained Vision-language Model, by Jaehyuk Heo et al.
-
Summary of Improving Mortality Prediction After Radiotherapy with Large Language Model Structuring Of Large-scale Unstructured Electronic Health Records, by Sangjoon Park et al.
-
Summary of Llava-vsd: Large Language-and-vision Assistant For Visual Spatial Description, by Yizhang Jin et al.
-
Summary of Profuser: Progressive Fusion Of Large Language Models, by Tianyuan Shi et al.
-
Summary of Order Matters in Hallucination: Reasoning Order As Benchmark and Reflexive Prompting For Large-language-models, by Zikai Xie
-
Summary of Generating Novel Experimental Hypotheses From Language Models: a Case Study on Cross-dative Generalization, by Kanishka Misra et al.
-
Summary of Mooer: Llm-based Speech Recognition and Translation Models From Moore Threads, by Junhao Xu et al.
-
Summary of Kif: Knowledge Identification and Fusion For Language Model Continual Learning, by Yujie Feng et al.
-
Summary of Vita: Towards Open-source Interactive Omni Multimodal Llm, by Chaoyou Fu et al.
-
Summary of Large Language Model Based Agent Framework For Electric Vehicle Charging Behavior Simulation, by Junkang Feng et al.
-
Summary of Large Model Strategic Thinking, Small Model Efficiency: Transferring Theory Of Mind in Large Language Models, by Nunzio Lore et al.
-
Summary of A Recurrent Yolov8-based Framework For Event-based Object Detection, by Diego A. Silva et al.
-
Summary of Chain Of Stance: Stance Detection with Large Language Models, by Junxia Ma et al.
-
Summary of Plugh: a Benchmark For Spatial Understanding and Reasoning in Large Language Models, by Alexey Tikhonov
-
Summary of Evaluating the Impact Of Advanced Llm Techniques on Ai-lecture Tutors For a Robotics Course, by Sebastian Kahl et al.
-
Summary of Knowledge Ai: Fine-tuning Nlp Models For Facilitating Scientific Knowledge Extraction and Understanding, by Balaji Muralidharan et al.
-
Summary of Batching Bpe Tokenization Merges, by Alexander P. Morgan
-
Summary of Xmainframe: a Large Language Model For Mainframe Modernization, by Anh T. V. Dau et al.
-
Summary of Strong and Weak Alignment Of Large Language Models with Human Values, by Mehdi Khamassi et al.
-
Summary of Citekit: a Modular Toolkit For Large Language Model Citation Generation, by Jiajun Shen et al.
-
Summary of Dopamin: Transformer-based Comment Classifiers Through Domain Post-training and Multi-level Layer Aggregation, by Nam Le Hai and Nghi D. Q. Bui
-
Summary of Mitigating Hallucinations in Large Vision-language Models (lvlms) Via Language-contrastive Decoding (lcd), by Avshalom Manevich et al.
-
Summary of Llm-based Mofs Synthesis Condition Extraction Using Few-shot Demonstrations, by Lei Shi et al.
-
Summary of Llms Are Not Just Next Token Predictors, by Stephen M. Downes et al.
-
Summary of Forecasting Live Chat Intent From Browsing History, by Se-eun Yoon et al.
-
Summary of Prompt and Prejudice, by Lorenzo Berlincioni et al.
-
Summary of Crest: Effectively Compacting a Datastore For Retrieval-based Speculative Decoding, by Sophia Ho et al.
-
Summary of Acl Ready: Rag Based Assistant For the Acl Checklist, by Michael Galarnyk et al.
-
Summary of Conversational Ai Powered by Large Language Models Amplifies False Memories in Witness Interviews, By Samantha Chan et al.