Blog CMS

Category: Uncategorized

Run AI Models Locally on Your Computer with Ollama
A few months ago, while having breakfast with colleagues, the conversation turned to how AI is reshaping work. Suddenly, someone asked a pivotal question: “How do you think the AI revolution is going to impact our planet? It consumes a massive amount of energy.”

It’s a valid concern. Companies, organizations, and nations feel forced to adopt AI to stay competitive, making its usage inevitable. While renewable options like solar and nuclear power exist, scaling them to meet this demand comes with significant challenges.

The Case for Local AI

Fortunately, there is a sustainable alternative. Many capable AI models designed for small-to-medium tasks can run directly on your personal computer. By shifting lighter workloads to local machines, we can collectively reduce the demand on massive, energy-intensive cloud infrastructure. It’s a win-win for productivity and the planet.

Beyond sustainability, running AI locally offers a major privacy advantage. You don’t have to worry about third-party companies collecting your personal data or injecting ads into your chat experience.

Previously, tools for running local models were often strictly command-line based and difficult to recommend to casual users. That has changed with Ollama.

Getting Started with Ollama

Ollama makes running these models incredibly user-friendly. Here is how to get set up:
1. Download the App: Visit the Ollama download page to get the installer for your system.
2. Install & Run: Once installed, Ollama typically downloads Google’s Gemma model by default. At around 3.3GB, it is efficient and runs smoothly on most modern computers.
Note: You can explore other models and check their file sizes in the Ollama Library.

The Experience

Ollama provides a clean desktop interface for chatting with your AI. For those who prefer it, the CLI (Command Line Interface) is also available and robust.

Here is a quick look at the CLI tool:

I guess the only real way to know you’re talking to an AI is that it always insists on having the last word! LOL.

I have personally tested Mistral AI’s Mistral 7B and Meta’s Llama 3.2 on a Mac Mini M1 with 16GB of unified memory, and both performed very well.

Local AI is no longer just for developers. It is an accessible, private, cost-friendly and efficient way to use AI every day.
January 5, 2026
Optimizing Input Fields with Debounce

Debounce is a powerful technique for optimizing resource-intensive features in input fields, such as search and autocomplete. For example, consider a search feature that displays autocomplete suggestions. It’s inefficient to call the search API for every single character typed because:

1. A single letter is rarely enough to produce meaningful results.

2. For multi-word queries, calling the API for each partially typed word is wasteful.

A very basic solution is to detect “complete words” by splitting on spaces, but that doesn’t help when the user only types one term (with no spaces). Another strategy is sending requests at fixed time intervals while the user is typing, which cuts down on unnecessary calls.

However, the most effective approach is Debounce. Debounce ignores rapid, consecutive function calls until the user pauses for a certain period, then sends one final request. As soon as the user stops typing, only the last input triggers the API call.

See this CodePen example for a working demo:

See the Pen Debounce for Input Fields by Paul Merupu (@Paul-Merupu) on CodePen.

With Debounce in your toolkit, you can make your search and autocomplete features much more efficient and user-friendly. Give it a try, and happy coding!

February 24, 2025

Category: Uncategorized

Run AI Models Locally on Your Computer with Ollama

The Case for Local AI

Getting Started with Ollama

The Experience

Optimizing Input Fields with Debounce