arxiv_cv 95% Match Research Paper AI Safety Researchers,Machine Learning Engineers,Cybersecurity Professionals,Deep Learning Practitioners 2 days ago

ANCHOR: Integrating Adversarial Training with Hard-mined Supervised Contrastive Learning for Robust Representation Learning

ai-safety › robustness

📄 Abstract

Abstract: Neural networks have changed the way machines interpret the world. At their core, they learn by following gradients, adjusting their parameters step by step until they identify the most discriminant patterns in the data. This process gives them their strength, yet it also opens the door to a hidden flaw. The very gradients that help a model learn can also be used to produce small, imperceptible tweaks that cause the model to completely alter its decision. Such tweaks are called adversarial attacks. These attacks exploit this vulnerability by adding tiny, imperceptible changes to images that, while leaving them identical to the human eye, cause the model to make wrong predictions. In this work, we propose Adversarially-trained Contrastive Hard-mining for Optimized Robustness (ANCHOR), a framework that leverages the power of supervised contrastive learning with explicit hard positive mining to enable the model to learn representations for images such that the embeddings for the images, their augmentations, and their perturbed versions cluster together in the embedding space along with those for other images of the same class while being separated from images of other classes. This alignment helps the model focus on stable, meaningful patterns rather than fragile gradient cues. On CIFAR-10, our approach achieves impressive results for both clean and robust accuracy under PGD-20 (epsilon = 0.031), outperforming standard adversarial training methods. Our results indicate that combining adversarial guidance with hard-mined contrastive supervision helps models learn more structured and robust representations, narrowing the gap between accuracy and robustness.

Authors (3)

Samarup Bhattacharya

Anubhab Bhattacharya

Abir Chakraborty

Submitted

October 31, 2025

arXiv Category

cs.CV

arXiv PDF

Key Contributions

ANCHOR is a framework that combines adversarial training with supervised contrastive learning and explicit hard positive mining to learn robust image representations. This approach aims to make neural networks more resilient to adversarial attacks by ensuring that learned embeddings are discriminative even for subtly perturbed inputs.

Business Value

Increases the reliability and security of AI systems deployed in critical applications, reducing risks associated with malicious manipulation of inputs.

Paper Metadata

Innovation Type

Algorithmic Improvement

Deployment Feasibility

Moderate, adversarial training can be computationally expensive and may sometimes reduce performance on clean data.

Limitations Addressed

Vulnerability of neural networks to adversarial attacks,Poor generalization of models under small, imperceptible input changes,Limitations of standard training methods in producing robust representations

Technical Tags

adversarial trainingsupervised contrastive learningrobust representation learninghard positive mininggradient attacksneural network vulnerabilityembedding space

Research Topics

AI SafetyRobustnessAdversarial Machine LearningRepresentation LearningDeep Learning

Methods & Architectures

ANCHOR (Adversarially-trained Contrastive Hard-mining for Optimized Robustness)Supervised Contrastive LearningHard Positive MiningAdversarial Training

Applications & Tasks

Computer Vision Machine Learning Security Adversarial AttacksNeural Network VulnerabilityRobustness to PerturbationsRepresentation Learning Quality Robust Representation LearningClassificationDetection

Related Fields

Machine LearningComputer VisionCybersecurityDeep Learning Theory

Keywords

adversarial trainingcontrastive learningrobustnessrepresentation learninghard miningadversarial attacksneural networksdeep learningai safetycomputer vision

Academic Context

#AI Safety#Robustness#Adversarial Machine Learning#Representation Learning#Deep Learning

Commercial Potential

Potential Products

Robust machine learning librariesSecure AI systems for critical infrastructureAdversarial defense modules

Target Industries

FinanceHealthcareAutomotiveDefenseCybersecurity

Use Case Examples

Secure facial recognition systemsReliable medical image analysisRobust object detection for autonomous vehicles

Competitive Edge

Combines multiple advanced techniques (adversarial training, contrastive learning, hard mining) to achieve potentially superior robustness compared to methods using only one or two.

Market Opportunity

Growing, as AI robustness becomes a critical requirement.

Revenue Models

Licensing of robust models/algorithmsconsulting services.

Resource Requirements

Compute Needs

High, adversarial training is computationally intensive.

Data Requirements

Standard image datasets, potentially augmented with adversarial examples.

Deployment Constraints

Increased training time and computational cost.

Scalability

Scalability might be a concern due to the computational cost of adversarial training.

Production Readiness

Maturity Level

Research

Time to Market

2-3 years for optimization and integration.

Patent Potential

Moderate, for the ANCHOR framework.

View Full Paper Back to Papers