Data Science

2 months ago

Statistics made simple

- I have a weird relationship with statistics: on one hand, I try not to look at it too often. Maybe once or twice a year. It’s because analytics is not actionable: what difference does it make if a thousand people saw my article or ten thousand? I mean, sure, you might try to...

Backend dev Data Science

3 months ago

Free course: Cheat at Search Essentials

- A free introductory search course for anyone who wants better search without all the hard work...

AI / LLMs Data Science

Under the hood of Canada Spends with Brendan Samek

Backend dev Data Science

Highlights from my appearance on the Data Renegades podcast with CL Kao and Dori Wilson

AI / LLMs Data Science

4 months ago

LLM Judges aren’t the shortcut you think

- After the LLM judge hype curve crashes, what will come after?...

AI / LLMs Data Science

Bayesian A/B testing is not immune to peeking

Data Science

5 months ago

Reasoning boosts search relevance 15-30%

- Kicking the tires on an initial, naive agentic search with some thoughts on how it could be improved further...

AI / LLMs Data Science

6 months ago

My review of Claude's new Code Interpreter, released under a very confusing name

AI / LLMs Data Science

Recreating the Apollo AI adoption rate chart with GPT-5, Python and Pyodide

AI / LLMs Data Science

Wiggling Into Correlation

- Jeff Kaufman shared some data around contra dance attendance as a function of requirements on wearing surgical masks. He compares this data to survey data, which is a useful way to validate in both directions. I found the plot compelling for a different reason – depending on how...

Data Science

My take on blog analytics

- I recently read You do not need “analytics” for your blog because you are neither a military surveillance unit nor a commodity trading company by Leon Paternoster. It’s a well-argued piece, and I agree with the general thrust… but I also won’t be removing analytics from my site...

Data Science Security

On Perfectionism in Data

Data Science Design

7 months ago

From 3 TB RAM to 96 GB: superseding billion vector HNSW with 40x cheaper DiskANN

- An analysis of DiskANN, a newer graph-based ANN index built for cheaper disk while still retaining high recall and throughput....

Databases Data Science

You Have Too Many Metrics

- Metrics can be incredibly powerful. But you have too many of them. Let’s talk about how and when to use metrics. The Golden Rule The golden rule of metrics is this: any metric you maintain should directly drive action if outside expected bounds. The reason this is an important...

Data Science

First impressions from testing 4 Coding Agents with Jupyter Notebooks

- Say what you will about Jupyter Notebooks, but I think they are an incredible medium for learning and quick experimentation. I use Jupyter Notebooks all the time for my work and personal use. So, naturally, I was curious when I read that you could use Claude Code with Jupyter...

AI / LLMs Data Science

8 months ago

Follow Up: An Analysis of YouTube Links From The White House’s “Wire” Website

- After publishing my Analysis of Links From The White House’s “Wire” Website, Tina Nguyen, political correspondent at The Verge, reached out with some questions. Her questions made me realize that the numbers in my analysis weren’t quite correct (I wasn’t de-depulicating links...

Data Science

Various Hill Plots

- We begin with the ever intrusive normal distribution. Its Hill plot resembles the first half of a cycloid or something. Increasing the variance of the distribution does not change anything about the Hill plot. Changing its mean does not change the shape of the plot, but it...

Data Science

An Analysis of Links From The White House’s “Wire” Website

- A little while back I heard about the White House launching their version of a Drudge Report style website called White House Wire. According to Axios, a White House official said the site’s purpose was to serve as “a place for supporters of the president’s agenda to get the...

Data Science

9 months ago

Let the Model Write the Prompt

- Why Applications & Pipelines Should Use DSPy Below is a talk I delivered at the 2025 Data and AI Summit, focusing on how to use DSPy to define and optimize your LLM tasks. We use a toy geospatial conflation problem – the challenge of determining if two datapoints refer to the...

AI / LLMs Data Science

Be wary of high variance NDCG changes

- Which looks like a better change, this NDCG bump of 0.005 on baseline? In [1]: ndcgs(baseline).mean(), ndcgs(graded_syns).mean() Out[1]: (np.float64(0.5411098691836396), np.float64(0.5461684655797919)) Or the NDCG bump of 0.01 on top of baseline? In [1]: ndcgs(baseline).mean(),...

AI / LLMs Data Science

Should I still use analytics?

- I set up Google Analytics on my site in 2010, and since then use it to track page views to my site. I only care about page views, which I find useful to figure out which pages get the most traffic. It’s interesting data, and sometimes rather useful. But Google collects much more...

Data Science General tech

Liberating search from the search engine

- Modern search engines push waaay too much complexity into the engine. Frustrating search practitioners. Let’s stop doing that. Let’s just get the top N from the search engine, and boost/rerank/etc in our API code. Using tools we know and love. Elasticsearch, Vespa, Weaviate, and...

Data Science

A Small Model Just for Structured Output

- Osmosis-Structure-0.6B is a small model trained with reinforcement learning to do one thing well: extract structured data, typically JSON, from unformatted text. That’s it! Convincing LLMs to consistently produce JSON or specifically tagged answers has been a headache since...

AI / LLMs Data Science

10 months ago

RAG's big blindspot

- One huge gap I see in the RAG community is an over emphasis on human (or LLM) evals and lack of engagement based evals (ie clicks, conversions). Maybe RAG apps are too early in the build phase to have tons of live users? Or actually, as I suspect it’s just a hard problem? Let’s...

AI / LLMs Data Science

Rows per page

Page 1 of 8