Simple Model Choice Checklist

1. Be clear about the problem

What are we trying to predict or find?
What does “good enough” look like?
What happens if we get it wrong?

2. Start with the simplest thing

For predictions: try a small, simple model first.
For search: try keyword search (BM25) with a few improvements.
Build something fast and easy.
If the simple version hits the goal → stop. You’re done.

3. If the simple model fails, ask why

Bad data?
Fix the data before touching the model.

Missing information?
Add better features, fields, or structured data.

Model too weak?
Only then consider upgrading.

4. Only use bigger models if you REALLY need them

Move to advanced models only if:

The patterns are too complicated for a simple model
You must hit a very high accuracy target
Mistakes are expensive
You have lots of data
Users ask vague questions and you need to understand meaning

If you can’t tick at least two of these → stay simple.

5. Check the real-world impact

Is it cheap enough to run?
Is it fast enough?
Can we monitor it easily?
Can the team look after it?
Is it worth the extra complexity?

6. Improve in the cheapest order

Before using a big AI model:

Clean your data
Improve your features or chunking
Combine a few simple models
Then — only then — use the heavy stuff

Why This Matters

Most engineering teams over-engineer their ML solutions. They jump straight to LLMs, transformers, or complex neural networks when a simple logistic regression or BM25 search would solve the problem perfectly.

Start simple. Upgrade only when you have evidence you need to.

This checklist keeps you focused on solving the actual problem, not just using fancy technology.