In a Training Loop 🔄

11 8

Anton Tikhonov

itwastony

itwastony

AI & ML interests

MSc in AI. Research scientist/engineer, building foundation models

Recent Activity

upvoted an article 16 days ago

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

liked a model 3 months ago

Virtue-AI-HUB/VulnLLM-R-7B

upvoted a paper 3 months ago

ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression

View all activity

Organizations

upvoted an article 16 days ago

Article

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

exploding-gradients

•

Sep 16, 2025

• 20

liked a model 3 months ago

Virtue-AI-HUB/VulnLLM-R-7B

Text Generation • 8B • Updated Dec 12, 2025 • 10.6k • • 189

upvoted a paper 3 months ago

ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression

Paper • 2602.11008 • Published Feb 11 • 18

liked a dataset 7 months ago

MTSAIR/MWS-Vision-Bench

Viewer • Updated 15 days ago • 3.91k • 681 • 20

upvoted a paper 8 months ago

COSPADI: Compressing LLMs via Calibration-Guided Sparse Dictionary Learning

Paper • 2509.22075 • Published Sep 26, 2025 • 23

liked a model 9 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.63M • • 4.78k

liked a Space 11 months ago

VulnBuster

🛡

AI Security Agent: Multi-MCP Code Vulnerability Scanner

upvoted a paper about 1 year ago

ReplaceMe: Network Simplification via Depth Pruning and Transformer Block Linearization

Paper • 2505.02819 • Published Feb 19 • 26

upvoted an article about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 611

upvoted a paper about 1 year ago

Iterative Self-Training for Code Generation via Reinforced Re-Ranking

Paper • 2504.09643 • Published Apr 13, 2025 • 34

upvoted an article over 1 year ago

Article

o3-mini & Deepseek-R1

prithivMLmods

•

Feb 2, 2025

• 24

upvoted a paper over 1 year ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 154

updated a dataset over 1 year ago

itwastony/CryptOQA

Preview • Updated Dec 24, 2024 • 99 • 1

upvoted an article over 1 year ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

Isayoften

•

Aug 26, 2024

• 89

liked 4 models over 1 year ago

upvoted 2 collections over 1 year ago

LLMs (Multi-verse collection)

Collection

This is a group of our models that are trained using our new training technique • 3 items • Updated Mar 2 • 1

Cotype-Nano

Collection

Small and strong 1.5B models • 4 items • Updated Nov 26, 2024 • 19

Anton Tikhonov

AI & ML interests

Recent Activity

Organizations

itwastony's activity

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

VulnBuster

Vision Language Models (Better, faster, stronger)

o3-mini & Deepseek-R1

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚