Buti ka pa pakner, naka-setup na with a decent GPU he he. Sa iba napunta yung pang-hobby ko.
Ako, sa local AI, Llama.cpp, Ollama.....sa cpu mode pa rin dahil sa hardware limitations kaya limited sa <10B parameters sa LLM na 4-8bit, GGUF. Nag-install (using Windows installer) sa'kin yung LMStudio pero di stable siya sa'kin dahil hanggang AVX lang yung akin. AVX2 kasi ang requirement ng LMStudio kaya siguro nagkaganoon. Mas ok sa'kin yung Llama.cpp. Ang maganda sa LMStudio, meron na siyang matinong UI, pwede mong lagyan ng MCP server (local or galing sa HF, etc.) or as mcp client, may ibat' ibang user mode (user, power user, developer)....Mas madaling basahin yung docs via github:
You do not have permission to view the full content of this post.
Log in or register now.
Sa many ways of using it, go here:
You do not have permission to view the full content of this post.
Log in or register now.
Sa cli-mode, ok siya noong una kong gamitin.
Hindi ko na ginalugad yung ibang features, pero assuming you have the right hardware and knowhow, it works as stated. Meron ka ng LLM api server at the least for your other apps like sa browser, other OpenAI API compatible interfaces (open-webui, SillyTavern...), etc.
Sa'kin, naglalaro ako sa compatible na Gemma, Llama or Deepseek. Meron namang online/offline calculator to verify if your existing hardware can run the models you want. Sapilitan na lang ang local AI kung nasa remote area ako tulad ng probinsiya he he. Basta dala ko yung USB drive, any compatible pc will do.
Mas maganda talaga pag RTX cards ang gamit dahil widely supported yan ng AI program, and be sure >16GB man lang to run them smoothly. Yung Intel Arc graphics is still a good choice since nakahabol na sila sa AI requirements. Kahit nga Intel Core processor with NPU can benefit, but you need to find the app that supports it.
Sa setup mo, yang GPT-OSS-20B is the best choice. Kita naman sa LLM leaderboard yung benchmark niya. Sa pagpili, madali na basta alam mo yung full details ng model card niya.
PS. Kung expire na warranty ng mini-pc mo, taasan mo yung RAM kahit hanggang 64GB or higher. Ito yung nabasa ko to support this claim for LM studio use:
Top Tools for Hybrid CPU/GPU Workloads
- You do not have permission to view the full content of this post.
Log in or register now.: A popular desktop interface that includes a "GPU Offload" slider. It allows you to specify exactly how many layers of an LLM should run on the GPU; any remaining layers automatically run on the CPU. In late 2025 versions, it added advanced settings to specifically offload "expert weights" for Mixture of Experts (MoE) models to the CPU while keeping core tasks on the GPU.
Yan yung ibig kong sabihin!