1 249 640

Gurumurthi V Ramanan

GVR

https://surasys.co

AI & ML interests

Recent Activity

liked a model about 8 hours ago

Intel/GLM-4.7-int4-mixed-AutoRound

liked a model about 17 hours ago

delong-chen/VL-JEPA

upvoted a paper about 17 hours ago

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

View all activity

Organizations

upvoted a paper about 17 hours ago

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Paper • 2512.10942 • Published 19 days ago • 13

upvoted a paper 2 days ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published 7 days ago • 28

upvoted 3 articles 4 days ago

Article

Efficient MultiModal Data Pipeline

Jul 8

•

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

•

246

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

•

176

upvoted an article 9 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

13 days ago

•

upvoted 2 papers 16 days ago

A Survey of Vibe Coding with Large Language Models

Paper • 2510.12399 • Published Oct 14 • 49

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

Paper • 2512.10534 • Published 19 days ago • 31

upvoted a paper 17 days ago

One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Paper • 2512.07829 • Published 22 days ago • 21

upvoted an article 17 days ago

Article

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

18 days ago

•

upvoted 2 papers 18 days ago

CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning

Paper • 2511.18659 • Published Nov 24 • 18

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23 • 278

upvoted a collection 18 days ago

rnj-1

Collection

5 items • Updated 11 days ago • 39

upvoted a paper 18 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 29 days ago • 93

upvoted a paper 19 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 22 days ago • 74

upvoted an article 19 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

21 days ago

•

upvoted an article 26 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

27 days ago

•

549

upvoted a collection 28 days ago

BERT-Chat

Collection

BERTs that chat • 2 items • Updated Nov 28 • 12

upvoted a paper about 1 month ago

NVIDIA Nemotron Parse 1.1

Paper • 2511.20478 • Published Nov 25 • 21

upvoted a collection about 1 month ago

Tarka Embed V1

Collection

Efficient DFKD embeddings for language understanding • 5 items • Updated 13 days ago • 6

Gurumurthi V Ramanan

AI & ML interests

Recent Activity

Organizations

GVR's activity

Efficient MultiModal Data Pipeline

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

We Got Claude to Fine-Tune an Open Source LLM