Various pretrained models for analyzing documents. These need to be fine-tuned for a task
Nicholas Broad
nbroad
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 1 hour ago
nbroad/apigen-with-thinking-1.5k
published
a dataset
about 3 hours ago
nbroad/apigen-with-thinking-1.5k
updated
a dataset
about 6 hours ago
nbroad/hf-inference-providers-data
Organizations
summarization
Models, papers, datasets for summarization
-
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Paper • 2309.04269 • Published • 33 -
Benchmarking Large Language Models for News Summarization
Paper • 2301.13848 • Published • 1 -
google/pegasus-xsum
Summarization • Updated • 129k • • 213 -
google/pegasus-x-large
Updated • 79 • 20
financial 💰
models, datasets, spaces, papers related to financial use cases
-
human-centered-summarization/financial-summarization-pegasus
Summarization • 0.6B • Updated • 59.5k • • 140 -
ProsusAI/finbert
Text Classification • Updated • 2.26M • • 1.06k -
nbroad/ESG-BERT
Text Classification • 0.1B • Updated • 820 • 70 -
takala/financial_phrasebank
Updated • 11.2k • 242
pretraining
Document Models (Fine-tuned)
-
naver-clova-ix/donut-base-finetuned-cord-v2
Image-to-Text • Updated • 19.8k • 113 -
google/pix2struct-docvqa-base
Visual Question Answering • 0.3B • Updated • 2.05k • 42 -
google/pix2struct-docvqa-large
Visual Question Answering • Updated • 454 • 32 -
google/pix2struct-screen2words-base
Visual Question Answering • Updated • 167 • 24
attention and long context
-
Efficient Streaming Language Models with Attention Sinks
Paper • 2309.17453 • Published • 14 -
Effective Long-Context Scaling of Foundation Models
Paper • 2309.16039 • Published • 30 -
allenai/longformer-base-4096
Updated • 1.81M • 220 -
google/bigbird-roberta-base
Updated • 27.6k • 59
Code Models
Models for generating and analyzing code
Detect AI Generated Text
A collection of papers about detecting text generated by AI
-
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
Paper • 2301.11305 • Published • 2 -
Ghostbuster: Detecting Text Ghostwritten by Large Language Models
Paper • 2305.15047 • Published • 2 -
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
Paper • 2306.05540 • Published -
A Survey on LLM-generated Text Detection: Necessity, Methods, and Future Directions
Paper • 2310.14724 • Published • 1
Document Models (Pretrained)
Various pretrained models for analyzing documents. These need to be fine-tuned for a task
Document Models (Fine-tuned)
-
naver-clova-ix/donut-base-finetuned-cord-v2
Image-to-Text • Updated • 19.8k • 113 -
google/pix2struct-docvqa-base
Visual Question Answering • 0.3B • Updated • 2.05k • 42 -
google/pix2struct-docvqa-large
Visual Question Answering • Updated • 454 • 32 -
google/pix2struct-screen2words-base
Visual Question Answering • Updated • 167 • 24
summarization
Models, papers, datasets for summarization
-
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Paper • 2309.04269 • Published • 33 -
Benchmarking Large Language Models for News Summarization
Paper • 2301.13848 • Published • 1 -
google/pegasus-xsum
Summarization • Updated • 129k • • 213 -
google/pegasus-x-large
Updated • 79 • 20
attention and long context
-
Efficient Streaming Language Models with Attention Sinks
Paper • 2309.17453 • Published • 14 -
Effective Long-Context Scaling of Foundation Models
Paper • 2309.16039 • Published • 30 -
allenai/longformer-base-4096
Updated • 1.81M • 220 -
google/bigbird-roberta-base
Updated • 27.6k • 59
financial 💰
models, datasets, spaces, papers related to financial use cases
-
human-centered-summarization/financial-summarization-pegasus
Summarization • 0.6B • Updated • 59.5k • • 140 -
ProsusAI/finbert
Text Classification • Updated • 2.26M • • 1.06k -
nbroad/ESG-BERT
Text Classification • 0.1B • Updated • 820 • 70 -
takala/financial_phrasebank
Updated • 11.2k • 242
Code Models
Models for generating and analyzing code
pretraining
Detect AI Generated Text
A collection of papers about detecting text generated by AI
-
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
Paper • 2301.11305 • Published • 2 -
Ghostbuster: Detecting Text Ghostwritten by Large Language Models
Paper • 2305.15047 • Published • 2 -
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
Paper • 2306.05540 • Published -
A Survey on LLM-generated Text Detection: Necessity, Methods, and Future Directions
Paper • 2310.14724 • Published • 1