Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
yiwenX
/
MiniMind-MoE-640-120M
like
0
Text Generation
Transformers
PyTorch
Safetensors
Chinese
English
minimind
Mixture of Experts
mixture-of-experts
chinese
conversational
causal-lm
custom_code
arxiv:
1701.06538
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
MiniMind-MoE-640-120M
368 MB
1 contributor
History:
5 commits
yiwenX
Upload README.md with huggingface_hub
3255d25
verified
2 months ago
.gitattributes
1.52 kB
initial commit
2 months ago
README.md
8.57 kB
Upload README.md with huggingface_hub
2 months ago
chat_template.jinja
3.34 kB
Upload folder using huggingface_hub
2 months ago
config.json
905 Bytes
Upload folder using huggingface_hub
2 months ago
generation_config.json
111 Bytes
Upload folder using huggingface_hub
2 months ago
model.safetensors
77.7 MB
xet
Upload folder using huggingface_hub
2 months ago
model_minimind.py
22 kB
Upload folder using huggingface_hub
2 months ago
pytorch_model.bin
290 MB
xet
Upload folder using huggingface_hub
2 months ago
special_tokens_map.json
583 Bytes
Upload folder using huggingface_hub
2 months ago
tokenizer.json
440 kB
Upload folder using huggingface_hub
2 months ago
tokenizer_config.json
1.03 kB
Upload folder using huggingface_hub
2 months ago