interesting (recent) reads & listens

India's NEXT Economic Crisis: Super El Niño 2026 | Case Study
got a better appreciation for just how impactful Monsoon is for the Indian economy because of this video, and the science that determines it. ~600 million people depend on it.
Clinical Note Bloat Reduction for Efficient LLM Use
clinical notes also suffer from inefficient tokenization because they're out of distribution, similar to underrepresented or non-latin languages.
Event Loops Internals And How Redis Handles Multiple Connects on a Single Thread
processes, like event loops, get access to the data at execution time when the kernel buffer copies data to user space. event loops focus only on tasks with ready data to avoid being blocked.
Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration?
came across this today, very similar to ARC-AGI-3 in its requirement of active exploration + penalization for inefficiency. perhaps this is the future of agentic benchmarks.
ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence
a truly unique benchmark:
  1. out of distribution test environments
  2. strongly (imo a bit unfairly) penalizes inefficiency even when the final answer is correct
  3. no stated goals or instructions
i wonder if future benchmarks will take grading schemes from this, particularly on the efficiency front.
Terence Tao on Dwarkesh Patel's podcast
a good test of continual learning is to see if a model can take a hard math concept it learned before and still use it in a new chat, even when the idea is hidden inside a bigger problem and phrased totally differently.
Talk on the semantics of "Thinking Traces"
no correlation between length of thinking traces and complexity of the task.
Arpit Bhayani talks about real engineering for 1 hour straight
love the authenticity of this talk: there are no silver bullets in distributed systems, every decision is a tradeoff.
Tata Motors Demerger
the unit economic breakdowns this channel gives in its case studies are always impressive, and the analysis of additional value creation from the demerger in this video is the best example of that.
Life is Poker, Not Chess
an insightful essay on navigating decision making, risk management, and the general randomness of life.
Large Language Models are Geographically Biased
i've gone down somewhat of a rabbit hole on this topic... but i'm just now finding out how deep seated western biases are in these models, even after preference tuning.
Unintended Impacts of LLM Alignment on Global Representation
a truly perplexing finding: pre-alignment, models agree with the Nigerian public opinion the most. post alignment, USA.
Doomprompting Is the New Doomscrolling.
(paraphrased) since writing is thinking, ai will make the "write" and "write-not" into the "think" and "think-not"
SuperBPE: Space Travel for Language Models
tokenization is a balancing act of:
1. bloated vocab sizes due to multi word merges
2. improved token fertility and inference speed
3. preventing suboptimal greedy merges early on
François Chollet: ARC-3 and the Path to AGI
love his skepticism (watch his dwarkesh interview) and relatively high clarity on the meaning of intelligence
"nothing is truly unique; we are surrounded by isomorphisms."
"intelligence is the efficiency with which you operationalize the past to deal with the future"
Multilingual Machine Translation & Evaluation for Indian Languages - @prajdabre at RespAI Lab
great summary on the existing state of multilingual llms. his RomanSetu paper is probably the most fun i've had reading a paper
Andrej Karpathy: Software Is Changing (Again)
1. love the llm ~= os analogy
2. start enabling your tools/projects to be easily used by agents.
3. less humans clicking around to set up your tool, more agents doing it.
The DeepSeek Documentary on Liang Wenfeng, R1 and What's Next
incredible video on the before and after of the deepseek moment in january'25
How Zepto Became India's Fastest Growing Startup
loved the quote 'build so that you get to wake up tomorrow and build again'
In-context Mixing (ICM): Code-mixed Prompts for Multilingual LLMs
around the same time this paper came out, i was playing around with a project i called "interspersed bilingual decoder" that was similar in its random replacement of certain POS. clearly, they have much stronger evals.
Let's build the GPT Tokenizer
video from last year that i rewatched recently. it inspired me to build polyglot.
Do you think that ChatGPT can reason?
takeaway: being skeptical is essential in science
p.s. i wish i took Dr. Rao's courses during my time @ asu
Measuring Entrainment in Spontaneous Code-switched Speech
would love to run the experiments on a code-switched mix of two low-resource languages, plot language pairs against one another to quantify the "universality" of entrainment.
bg2 pod
loved the discussion on how tariffs can breed globally uncompetitive products
Make Something Heavy
an essay on the importance of creating substantial work that i found particularly eye-opening
Sarah Paine: "The War For India" on Dwarkesh Patel's podcast
i attended this live lecture in san francisco, it was incredibly informative
Narendra Modi on Lex Fridman's podcast
his wisdom on focus, meditation, and serving his people is inspiring

Venkat Ramaraju

engineer @ tabapay working on all things payments.

interested in cross-lingual llms.

experience

swe iii @ tabapay jan'23 - present
sde @ amazon lab126 jun'22 - jan'23
swe @ redhat may'21 - jun'22
researcher @ compact x-ray free electron lab jan'21 - may'21
teaching assistant, cse 110 @ asu 3 semesters

education

bachelors in computer science @ arizona state university aug'18 - dec'21
graduated early with a 4.0 GPA

projects

some large efforts, some weekend hacking projects

polydb: a vector database written from scratch in go
trained an embedding model from scratch via sgns + pytorch. apiserver communicates with vector services via grpc. more training runs in progress.

at some point, i may completely overhaul the model and train it to align semantically similar sentences from different languages into similar vector spaces. this would allow users to search various documents in whatever language they would like.
flowcast: an xgboost model that predicts 15-minute net bike flow for lyft bike stations in the bay area
1. trained an xgboost model on 4 years of lyft bike rides to predict net bike flow throughout the day for each station based on weather, day/time and other signals.
2. achieved a MAE of 1.07 on the validation set.
3. built a fullstack app (fastapi + react) to interactively run model inference.
4. need to increase feature vectors, perhaps adding information about ongoing events in the area of the station.
polyglot: a multilingual tokenizer implemented from scratch in go via the byte-pair encoding algorithm
achieves uniform compression and fertility across across 10 diverse scripts. training to achieving 5.0 compression in progress.
venkbot: a personal agentic toolkit to automate mundane tasks in my life
whatsapp chat (twilio) hits my self-hosted server; an llm-backed dispatcher invokes the right mix of bespoke tools + mcps to perform the task.
dataquest.ai: an authenticated ai natural language querying tool for documents, datasets, videos, emails, etc.
this application implements rate limiting, request caching, and connection pooling from scratch.
leverages pinecone, langchain with gpt3.5 turbo, gmail api, youtube transcription api, and stripe api integrations.
whaletracker: realtime whale trade tracker with polymarket websockets
dynamically discovers and subscribes to new markets, computes a spread asymmetry pressure metric, stores data in redis, sends email alerts on threshold breach
fb-finetuned: finetuned gpt-oss-20b on 1.5 yrs of my facebook texts to learn my texting style
1. built an dataset generation agent with langgraph + ollama (llamab3.2b)
2. finetuned gpt-oss-20b with peft + lora (shoutout unsloth!)
3. will create a stt pipeline soon with the finetuned model
zillow-bot: a bot that emails you new weekly zillow postings based on your search criteria
1. rapid api for access to zillow data
2. s3 to store reports
3. weekly cronjob set up with github actions for workflow automation
interspersed bilingual decoder (ibd): a code-mixing decoder-only model fine tuned on top of LLama-3.1.
generated hinglish code-mixed datasets using pos tagging with stanza/spacy, performed sft + dpo for alignment.
agora: stock recommender based on public sentiment
uses VADER sentiment analysis models, daily web scrapers using selenium, yfinance api, xgboost and random forest ensemble models

big shoutout to Dr. Ajay Bansal and his PhD student James Smith for their support in elevating this project.

papers

Forecasting Stock Market Performance: An Ensemble Learning-Based Approach
Journal paper - Nominated by AIKE 2023 chairs for the IJSC 2024 edition
Forecasting Stock Market Performance: An Ensemble Learning-Based Approach
Long paper - AIKE 2023, Published in IEEE Xplore
A Sentiment Analysis Based Stock Recommendation System
Long paper - AIKE 2022, Published in IEEE Xplore
Agora: Introducing the Internet's Opinion to Traditional Stock Analysis and Prediction
Short paper - ICSC 2022, Published in IEEE Xplore