AI Research Intelligence Brief - October 17th, 2025 - Academic Research

# Academic Research Intelligence

Deep dive into AI research papers for researchers and academics

---

Executive Summary

1. ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts

Introduces ScholarBench, a benchmark for evaluating LLMs on complex academic problem-solving. It targets specialized contexts to assess academic reasoning ability, addressing limitations of prior benchmarks lacking scalability for deep expert knowledge.

AI Research Intelligence Brief - October 17th, 2025 - Academic Research

📑 Quick Navigation

Executive Summary

1. ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts

2. Are LLMs Stable Formal Logic Translators in Logical Reasoning Across Linguistically Diversified Texts?

3. EasyNER: A Customizable Easy-to-Use Pipeline for Deep Learning- and Dictionary-based Named Entity Recognition from Medical and Life Science Text

4. CAP: Evaluation of Persuasive and Creative Image Generation

5. Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding

6. Capture, Canonicalize, Splat: Zero-Shot 3D Gaussian Avatars from Unstructured Phone Images

7. PIA: Deepfake Detection Using Phoneme-Temporal and Identity-Dynamic Analysis

8. CLEAR: Causal Learning Framework For Robust Histopathology Tumor Detection Under Out-Of-Distribution Shifts

9. Vision-Centric Activation and Coordination for Multimodal Large Language Models

10. DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation

11. Pruning Overparameterized Multi-Task Networks for Degraded Web Image Restoration

12. PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

13. Towards Generalist Intelligence in Dentistry: Vision Foundation Models for Oral and Maxillofacial Radiology

14. Acquisition of interpretable domain information during brain MR image harmonization for content-based image retrieval

15. Consistent text-to-image generation via scene de-contextualization

16. Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video

17. Efficient Video Sampling: Pruning Temporally Redundant Tokens for Faster VLM Inference

18. Adapting Self-Supervised Representations as a Latent Space for Efficient Generation

19. SteeringTTA: Guiding Diffusion Trajectories for Robust Test-Time-Adaptation

20. WeCKD: Weakly-supervised Chained Distillation Network for Efficient Multimodal Medical Imaging

AI for Science

1. CLEAR: Causal Learning Framework For Robust Histopathology Tumor Detection Under Out-Of-Distribution Shifts

2. EasyNER: A Customizable Easy-to-Use Pipeline for Deep Learning- and Dictionary-based Named Entity Recognition from Medical and Life Science Text

3. Element2Vec: Build Chemical Element Representation from Text for Property Prediction

4. Biology-informed neural networks learn nonlinear representations from omics data to improve genomic prediction and interpretability

5. Improving Intrusion Detection with Domain-Invariant Representation Learning in Latent Space

6. Towards geological inference with process-based and deep generative modeling, part 1: training on fluvial deposits

AI Safety & Ethics

1. NAPPure: Adversarial Purification for Robust Image Classification under Non-Additive Perturbations

2. Are LLMs Stable Formal Logic Translators in Logical Reasoning Across Linguistically Diversified Texts?

3. PIA: Deepfake Detection Using Phoneme-Temporal and Identity-Dynamic Analysis

4. Vision-Centric Activation and Coordination for Multimodal Large Language Models

5. CLEAR: Causal Learning Framework For Robust Histopathology Tumor Detection Under Out-Of-Distribution Shifts

6. LOTA: Bit-Planes Guided AI-Generated Image Detection

7. Structured Universal Adversarial Attacks on Object Detection for Video Sequences

8. RAID: Refusal-Aware and Integrated Decoding for Jailbreaking LLMs

AI Theory & Foundations

1. Are LLMs Stable Formal Logic Translators in Logical Reasoning Across Linguistically Diversified Texts?

2. Multilinguality Does not Make Sense: Investigating Factors Behind Zero-Shot Transfer in Sense-Aware Tasks

3. TextBandit: Evaluating Probabilistic Reasoning in LLMs Through Language-Only Decision Tasks

4. Interpreting the Latent Structure of Operator Precedence in Language Models

5. LLM Prompt Duel Optimizer: Efficient Label-Free Prompt Optimization

6. RAID: Refusal-Aware and Integrated Decoding for Jailbreaking LLMs

Computer Vision

1. CAP: Evaluation of Persuasive and Creative Image Generation

2. Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding

3. Capture, Canonicalize, Splat: Zero-Shot 3D Gaussian Avatars from Unstructured Phone Images

4. MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching

5. GauSSmart: Enhanced 3D Reconstruction through 2D Foundation Models and Geometric Filtering

6. A Multi-domain Image Translative Diffusion StyleGAN for Iris Presentation Attack Detection

7. Vision-Centric Activation and Coordination for Multimodal Large Language Models

8. DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation

9. NAPPure: Adversarial Purification for Robust Image Classification under Non-Additive Perturbations

10. TopoStreamer: Temporal Lane Segment Topology Reasoning in Autonomous Driving

Efficient AI

1. Real-Time Neural Video Compression with Unified Intra and Inter Coding

2. Low Power Vision Transformer Accelerator with Hardware-Aware Pruning and Optimized Dataflow

3. BitNet Distillation

4. ELASTIC: Efficient Once For All Iterative Search for Object Detection on Microcontrollers

5. Efficient Dynamic Structured Sparse Training with Learned Shuffles

6. Enhancing Time-Series Anomaly Detection by Integrating Spectral-Residual Bottom-Up Attention with Reservoir Computing

Generative AI

1. DOS: Directional Object Separation in Text Embeddings for Multi-Object Image Generation

2. On the Ability of LLMs to Handle Character-Level Perturbations: How Well and How?

3. Visual Stereotypes of Autism Spectrum in Janus-Pro-7B, DALL-E, Stable Diffusion, SDXL, FLUX, and Midjourney

4. Contrastive Diffusion Alignment: Learning Structured Latents for Controllable Generation

5. A Multi-domain Image Translative Diffusion StyleGAN for Iris Presentation Attack Detection

6. CAP: Evaluation of Persuasive and Creative Image Generation

7. Real-Time Adaptive Motion Planning via Point Cloud-Guided, Energy-Based Diffusion and Potential Fields

8. Towards geological inference with process-based and deep generative modeling, part 1: training on fluvial deposits

9. Generating High Dimensional User-Specific Wireless Channels using Diffusion Models

10. GauSSmart: Enhanced 3D Reconstruction through 2D Foundation Models and Geometric Filtering

Graph Neural Networks

1. Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding

2. Boosting Graph Foundation Model from Structural Perspective

3. Multimodal RAG for Unstructured Data:Leveraging Modality-Aware Knowledge Graphs with Hybrid Retrieval

4. Learning Wireless Interference Patterns: Decoupled GNN for Throughput Prediction in Heterogeneous Multi-Hop p-CSMA Networks