LM Studio users, what are your current local LLM models?

Draft · Jan 7, 2026

alist1986 said:
Pinokio is the easiest to integrate/manage your local AI servers

You do not have permission to view the full content of this post. Log in or register now.

Salamat sa tips.

PHC-TheGlock · Mar 3, 2026

DeepSeek v3.1
Usually ginagamit ko sya for asking about networking. Also kapag mag tratranslate EN -> CN, CN -> EN

alist1986 · Mar 3, 2026

Draft said:
You do not have permission to view the full content of this post. Log in or register now.

Salamat sa tips.

Usually, nandyan naman lahat. Binanggit ko lang dahil limited yung LM Studio sa LLMs. VLMs, at embeddings. It can only handle static configurations of online LLM apis or use LM Studio as an endpoint for your other apps using the local models or mixture of both. For dynamic routing of multiple apis from same or different providers, pwede yang i-partner sa litellm - from api collections. May variety na yung LLM choices for specific tasks na di kaya ng local AI (for the hardware at hand). Sa setup mo, pwede na lahat from music generation na suno grade, create your own Filipino TTS voice model, HQ video generation, atbp. sa dami ng China models na lumabas ngayon he he. Intel Core 9 na lang kulang ko with at least 64 GB at a decent RTX card or alternative na compute-only (monitor-less) GPU with 24GB VRAM kahit luma (~10K) for the AI projects...Si misis lang ang problema ko kahit akin yung pondo he he! Putris talaga itong mga "maybahay" na ito.

PHC-TheGlock said:
DeepSeek v3.1
Usually ginagamit ko sya for asking about networking. Also kapag mag tratranslate EN -> CN, CN -> EN

Dagdag mo yung api ng deepseek-3.1 dyan: You do not have permission to view the full content of this post. Log in or register now.
(Di ko na nilagyan ng invitation link for the credits.)
May free $100 to $225 free credits upon login. Good to use kung ayaw mong mag-lag pc using other tasks while online.

PHC-TheGlock · Mar 3, 2026

alist1986 said:
Usually, nandyan naman lahat. Binanggit ko lang dahil limited yung LM Studio sa LLMs. VLMs, at embeddings. It can only handle static configurations of online LLM apis or use LM Studio as an endpoint for your other apps using the local models or mixture of both. For dynamic routing of multiple apis from same or different providers, pwede yang i-partner sa litellm - from api collections. May variety na yung LLM choices for specific tasks na di kaya ng local AI (for the hardware at hand). Sa setup mo, pwede na lahat from music generation na suno grade, create your own Filipino TTS voice model, HQ video generation, atbp. sa dami ng China models na lumabas ngayon he he. Intel Core 9 na lang kulang ko with at least 64 GB at a decent RTX card or alternative na compute-only (monitor-less) GPU with 24GB VRAM kahit luma (~10K) for the AI projects...Si misis lang ang problema ko kahit akin yung pondo he he! Putris talaga itong mga "maybahay" na ito.

Dagdag mo yung api ng deepseek-3.1 dyan: You do not have permission to view the full content of this post. Log in or register now.
(Di ko na nilagyan ng invitation link for the credits.)
May free $100 to $225 free credits upon login. Good to use kung ayaw mong mag-lag pc using other tasks while online.

minsan ko lang nman sya magamit... gpu mainly for editing and rendering 3d ko 'to ginagamit
5090 user btw

alist1986 · Mar 4, 2026

PHC-TheGlock said:
minsan ko lang nman sya magamit... gpu mainly for editing and rendering 3d ko 'to ginagamit
5090 user btw

Ayos naman pala yang hardware mo - nakaka-inggit he he! Marami pang spare VRAM for other tasks. Di ko area yan. Sa data and knowledge processing yung target ko using offline,online AI or mixed to cope with my hardware limitations.

Maganda ngang tanungin yang Deepseek-3.1-37B, kahit yung Qwen2.5-Coder-32B for networking analysis. Makakagawa ka ng pamalit sa E×ρréššVPN w/out spending a dime, with matching browser specifics for a specific purpose, or solve any complex networking scenarios you can think of with just a prompt - CCNA certified na sila he he. Mataas na yung proficiency nila sa networking.

Draft · Mar 4, 2026

PHC-TheGlock said:
5090 user btw

Congrats, Bossing!

Skia Monarch · Mar 5, 2026

Draft said:
View attachment 4083762

Congrats, Bossing!

Congrats din sa'yo, bossing, because of your forum, we cannot meet these Pinoy enthusiasts. Gusto ko naman maka connect ako sa mga enthusiast pagdating sa technology. Love reading this good discussion sharing their insights, knowledge and information. Mas iba talaga feeling kung human interaction. na realize ko lang ito nung napanood ko yung podcast ng Rota Wheels from Autoindustriya.com sa YouTube kung pano din sila nag meet and nag establish ng brand and daming connection through forum. Probably ka era mo sila boss draft. hehe so back to topic.

I want to hear more from you guys about taking your AI practice into using Agents and Automation. kung stabilize lang sana bansa natin and united, mas may potential pa ako makikita na maka invest tayo more sa hardware and maka on par din sa first world countries.

Draft · Apr 10, 2026

Using this model for a few days now: google/gemma-4-26b-a4b

Skia Monarch · Apr 11, 2026

Draft said:
Using this model for a few days now: google/gemma-4-26b-a4b

View attachment 4130622

Nag setup ako ng gemma e2b sa low end spec ng work pc ko haha ang tagal mag results xD

Draft · Apr 11, 2026

Skia Monarch said:
Nag setup ako ng gemma e2b sa low end spec ng work pc ko haha ang tagal mag results xD

Run mo dapat sa GPU, iGPU, or NPU para medyo mabilis

.

PHC-TheGlock · Apr 11, 2026

Draft said:
Using this model for a few days now: google/gemma-4-26b-a4b

View attachment 4130622

Tinry ko sakin, ambilis

Ganda nya rin gamitin compare sa ibang models na nakainstall sakin.

alist1986 · Apr 11, 2026

Skia Monarch said:
Nag setup ako ng gemma e2b sa low end spec ng work pc ko haha ang tagal mag results xD

Kung talagang low-end tulad ng akin he he, kaya mong pabilisin yan ng kaunti gamit yung ONNX model (Q4_K_M) na 3.2GB. Sa CPU mode mga 8-10 tokens/sec sa old 3rd-gen Quad ko. Pasok pa rin sa normal reading speed ng tao na 5 - 8 tokens/sec (~6 words/sec). May initial delay lang talaga bago sumagot kahit sa 4GB GPU, lalo pa sa CPU mode. May mga nabasa akong setup using gemma e2b ONNX models for ARM like Rasbberry Pi5 na 8 - 12 tokens/sec.

Draft · Apr 12, 2026

PHC-TheGlock said:

5090 ba ito?

Skia Monarch · Apr 13, 2026

Draft said:
Run mo dapat sa GPU, iGPU, or NPU para medyo mabilis .

yung GPU ko is GeForce 1050 Ti lang haha

xLynx · Apr 22, 2026

PHC-TheGlock said:
Tinry ko sakin, ambilisView attachment 4132257

Ganda nya rin gamitin compare sa ibang models na nakainstall sakin.

anong prompt at gpu boss? haha

xLynx · Apr 22, 2026

Ito sakin 11.46tok/s

gemma4:26b
5060ti 16Gb
openwebui/ollama

Eto prompt:

Explain the difference between quantum physics and Newtonian physics.

Click to expand...

X_Space_X · Apr 24, 2026

Draft said:
You do not have permission to view the full content of this post. Log in or register now.

Salamat sa tips.

salamat po boss draft

Search

Search

LM Studio users, what are your current local LLM models?

Draft

PHC-TheGlock

alist1986

Forum Guru

PHC-TheGlock

alist1986

Forum Guru

Draft

Skia Monarch

Draft

Skia Monarch

Draft

PHC-TheGlock

alist1986

Forum Guru

Draft

Skia Monarch

xLynx

xLynx

X_Space_X

Journeyman

About this Thread

New Topics

100% FREE AI That Turns Text Into Real CAD Models

100% FREE Local OCR AI That Runs on Your PC

Google's Gemini 3.6 Flash, 3.5 Flash-Lite & 3.5 Flash Cyber: A Leaner, Agent-Ready Model Lineup

100% FREE Open-Source AI Agent Framework That Runs on Your PC

LF Talkpal po kahit trial lang

FREE $4,000 API Credits - GLM 5.2, DeepSeek, KIMI Free - No Card Needed, Register lang!

ExtremeRouter: Upgraded Version of 9Router

Best ai as of now

FREE $150 API CREDIT - gpt-5.5, claude-opus-4-6, 7, and 8, and glm-5.2

Trick kung paano malaman ang next Free Codex reset

Trending Topics

Online now

Forum statistics