Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Xinnan Zhang
XinnanZhang
Follow
quanwei0's profile picture
SiliangZ's profile picture
RedMist137's profile picture
3 followers
·
4 following
AI & ML interests
None yet
Recent Activity
updated
a model
19 days ago
XinnanZhang/Alfworld-3b-sft-obs3-hint2-10k-2epoch
published
a model
19 days ago
XinnanZhang/Alfworld-3b-sft-obs3-hint2-10k-2epoch
updated
a model
29 days ago
XinnanZhang/Alfworld-qwen2.5-3b-it-1-1wo-obs
View all activity
Organizations
XinnanZhang
's models
30
Sort: Recently updated
XinnanZhang/Alfworld-3b-sft-obs3-hint2-10k-2epoch
Text Generation
•
3B
•
Updated
19 days ago
•
52
XinnanZhang/Alfworld-qwen2.5-3b-it-1-1wo-obs
3B
•
Updated
29 days ago
•
13
XinnanZhang/Alfworld-qwen2.5-7b-it-world-3epoch
Text Generation
•
8B
•
Updated
29 days ago
•
30
XinnanZhang/Alfworld-qwen2.5-7b-it-world-4epoch
Text Generation
•
8B
•
Updated
29 days ago
•
16
XinnanZhang/Alfworld-qwen2.5-3b-it-world-2
Text Generation
•
3B
•
Updated
about 1 month ago
•
17
XinnanZhang/Alfworld-qwen2.5-3b-it-obs-2
Text Generation
•
3B
•
Updated
Nov 29
•
58
XinnanZhang/Alfworld-qwen2.5-3b-SFT
Text Generation
•
3B
•
Updated
Nov 29
•
3
XinnanZhang/Alfworld-qwen2.5-3b-it-wo-obs
Text Generation
•
3B
•
Updated
Nov 29
•
18
XinnanZhang/70b_merged_b64_ckpt_13
Text Classification
•
7B
•
Updated
Nov 18
•
5
XinnanZhang/cl-actor-step325
8B
•
Updated
Sep 25
•
7
XinnanZhang/trl_8b_critic_c_8196_mse_epoch10_7b
Text Classification
•
7B
•
Updated
Apr 24
•
7
XinnanZhang/reasoning_qwen_1.5b_iter1_8k
Text Classification
•
2B
•
Updated
Apr 20
•
13
XinnanZhang/reasoning_qwen_1.5b_iter1_2k
Text Classification
•
2B
•
Updated
Apr 18
•
6
XinnanZhang/reasoning_qwen_1.5b_iter0
Text Classification
•
2B
•
Updated
Apr 18
•
14
XinnanZhang/trl_critic_c_8196_70b_mse_epoch10_iter2
Text Classification
•
7B
•
Updated
Mar 31
•
6
XinnanZhang/trl_critic_c_8196_70b_mse_epoch10_iter1
Text Classification
•
7B
•
Updated
Mar 18
•
6
XinnanZhang/trl_8b_critic_c_8196_mse_iter2_epoch10
Text Classification
•
7B
•
Updated
Feb 11
•
8
XinnanZhang/trl_8b_critic_c_8196_mse_iter2_epoch5
Text Classification
•
7B
•
Updated
Feb 11
•
8
XinnanZhang/trl_8b_critic_c_8196_mse_epoch10_iter2
Text Classification
•
7B
•
Updated
Feb 9
•
10
XinnanZhang/trl_8b_critic_c_8196_mse_epoch10
Text Classification
•
7B
•
Updated
Feb 9
•
8
XinnanZhang/trl_critic_c_8196_td_epoch10
Text Classification
•
7B
•
Updated
Jan 30
•
5
XinnanZhang/vllm_critic_c_8196_epoch5
Text Classification
•
7B
•
Updated
Jan 25
•
5
XinnanZhang/vllm_critic_c_8196_epoch10
Text Classification
•
7B
•
Updated
Jan 25
•
8
XinnanZhang/pythia_tldr_ppo_1b_critic_55513
0.9B
•
Updated
Dec 5, 2024
•
6
XinnanZhang/pythia_tldr_ppo_1b_policy_55513
Text Generation
•
1B
•
Updated
Dec 5, 2024
•
12
XinnanZhang/pythia_tldr_ppo_1b_policy_44413
Text Generation
•
1B
•
Updated
Dec 3, 2024
•
5
XinnanZhang/pythia_tldr_ppo_1b_critic_44413
0.9B
•
Updated
Dec 3, 2024
•
9
XinnanZhang/pythia_tldr_ppo_1b_value
Text Classification
•
0.9B
•
Updated
Nov 29, 2024
•
7
XinnanZhang/pythia_tldr_ppo_1b_policy
Text Generation
•
1B
•
Updated
Nov 29, 2024
•
6
XinnanZhang/zephyr-7b-sft-full
Updated
Aug 27, 2024