YAML Metadata Error:"widget[0]" must be of type object

YAML Metadata Error:"widget[1]" must be of type object

Wenzhong-GPT2-110M

็ฎ€ไป‹ Brief Introduction

ๅ–„ไบŽๅค„็†NLGไปปๅŠก๏ผŒไธญๆ–‡็‰ˆ็š„GPT2-Smallใ€‚

Focused on handling NLG tasks, Chinese GPT2-Small.

ๆจกๅž‹ๅˆ†็ฑป Model Taxonomy

้œ€ๆฑ‚ Demand ไปปๅŠก Task ็ณปๅˆ— Series ๆจกๅž‹ Model ๅ‚ๆ•ฐ Parameter ้ขๅค– Extra
้€š็”จ General ่‡ช็„ถ่ฏญ่จ€็”Ÿๆˆ NLG ้—ปไปฒ Wenzhong GPT2 110M ไธญๆ–‡ Chinese

ๆจกๅž‹ไฟกๆฏ Model Information

็ฑปไผผไบŽWenzhong2.0-GPT2-3.5B-chinese๏ผŒๆˆ‘ไปฌๅฎž็Žฐไบ†ไธ€ไธชsmall็‰ˆๆœฌ็š„12ๅฑ‚็š„Wenzhong-GPT2-110M๏ผŒๅนถไธ”ๅœจๆ‚Ÿ้“๏ผˆ300G็‰ˆๆœฌ๏ผ‰ไธŠ้ข่ฟ›่กŒ้ข„่ฎญ็ปƒใ€‚

Similar to Wenzhong2.0-GPT2-3.5B-chinese, we implement a small size Wenzhong-GPT2-110M with 12 layers, which is pre-trained on Wudao Corpus (300G version).

ไฝฟ็”จ Usage

ๅŠ ่ฝฝๆจกๅž‹ Loading Models

from transformers import GPT2Tokenizer,GPT2LMHeadModel
hf_model_path = 'IDEA-CCNL/Wenzhong-GPT2-110M'
tokenizer = GPT2Tokenizer.from_pretrained(hf_model_path)
model = GPT2LMHeadModel.from_pretrained(hf_model_path)

ไฝฟ็”จ็คบไพ‹ Usage Examples

question = "ๅŒ—ไบฌๆ˜ฏไธญๅ›ฝ็š„"
inputs = tokenizer(question,return_tensors='pt')
generation_output = model.generate(**inputs,
                                return_dict_in_generate=True,
                                output_scores=True,
                                max_length=150,
                                # max_new_tokens=80,
                                do_sample=True,
                                top_p = 0.6,
                                # num_beams=5,
                                eos_token_id=50256,
                                pad_token_id=0,
                                num_return_sequences = 5)

for idx,sentence in enumerate(generation_output.sequences):
    print('next sentence %d:\n'%idx,
    tokenizer.decode(sentence).split('<|endoftext|>')[0])
    print('*'*40)

ๅผ•็”จ Citation

ๅฆ‚ๆžœๆ‚จๅœจๆ‚จ็š„ๅทฅไฝœไธญไฝฟ็”จไบ†ๆˆ‘ไปฌ็š„ๆจกๅž‹๏ผŒๅฏไปฅๅผ•็”จๆˆ‘ไปฌ็š„่ฎบๆ–‡๏ผš

If you are using the resource for your work, please cite the our paper:

@article{fengshenbang,
  author    = {Jiaxing Zhang and Ruyi Gan and Junjie Wang and Yuxiang Zhang and Lin Zhang and Ping Yang and Xinyu Gao and Ziwei Wu and Xiaoqun Dong and Junqing He and Jianheng Zhuo and Qi Yang and Yongfeng Huang and Xiayu Li and Yanghan Wu and Junyu Lu and Xinyu Zhu and Weifeng Chen and Ting Han and Kunhao Pan and Rui Wang and Hao Wang and Xiaojun Wu and Zhongshen Zeng and Chongpei Chen},
  title     = {Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence},
  journal   = {CoRR},
  volume    = {abs/2209.02970},
  year      = {2022}
}

ไนŸๅฏไปฅๅผ•็”จๆˆ‘ไปฌ็š„็ฝ‘็ซ™:

You can also cite our website:

@misc{Fengshenbang-LM,
  title={Fengshenbang-LM},
  author={IDEA-CCNL},
  year={2021},
  howpublished={\url{https://github.com/IDEA-CCNL/Fengshenbang-LM}},
}
Downloads last month
1,951
Safetensors
Model size
0.1B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for IDEA-CCNL/Wenzhong-GPT2-110M

Quantizations
2 models

Spaces using IDEA-CCNL/Wenzhong-GPT2-110M 12

Paper for IDEA-CCNL/Wenzhong-GPT2-110M