Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
transformers-community
/
sep_cache
like
9
Follow
Transformers Community
103
Safetensors
English
llama
custom_generate
arxiv:
2412.12094
License:
llama3
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
sep_cache
/
original
/
params.json
Gausson
init commit
55427a1
verified
11 months ago
raw
Copy download link
history
blame
contribute
delete
Safe
221 Bytes
{
"dim"
:
4096
,
"n_layers"
:
32
,
"n_heads"
:
32
,
"n_kv_heads"
:
8
,
"vocab_size"
:
128256
,
"multiple_of"
:
1024
,
"ffn_dim_multiplier"
:
1.3
,
"norm_eps"
:
1e-05
,
"rope_theta"
:
500000.0
}