Jailbreak attack datasets generated against multiple LLMs, one dataset per attack method.
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 7
deepkeep-ai/openai-privacy-filter
Token Classification • 1B • Updated • 57
deepkeep-ai/stable-diffusion-xl-1.0-inpainting-0.1-9
Updated • 61
deepkeep-ai/napguard-patch-detector-3
Updated • 63
deepkeep-ai/sac-patch-segmenter-2
Updated • 74
deepkeep-ai/Ministral-3-8B-Instruct-2512
9B • Updated • 12k
deepkeep-ai/sae-guard-gemma3-4b-english-expanded
Feature Extraction • Updated • 1
deepkeep-ai/sae-guard-gemma3-4b-english-research
Feature Extraction • 1 • Updated • 10 • 1
datasets 8
deepkeep-ai/semantic-encoding-data-splits-llm-korean
Viewer • Updated • 16.5k • 27
deepkeep-ai/jigsaw_toxic_not_harmful_5k
Viewer • Updated • 5k • 26
deepkeep-ai/jigsaw_toxic_not_harmful_5k_translated
Viewer • Updated • 5k • 30
deepkeep-ai/notinject_expanded_1k_qwen35_9b_cuda_translated_roleplay
Viewer • Updated • 1k • 121
deepkeep-ai/seq_cls_train_translated_v3
Viewer • Updated • 2.15k • 22
deepkeep-ai/datasets
Updated • 18
deepkeep-ai/AdvBench-gcg
Viewer • Updated • 268 • 9
deepkeep-ai/benchoverflow
Viewer • Updated • 2.98k • 3