You do not have permission to view the full content of this post. Log in or register now.Pinokio is the easiest to integrate/manage your local AI servers
Salamat sa tips.
You do not have permission to view the full content of this post. Log in or register now.Pinokio is the easiest to integrate/manage your local AI servers
Usually, nandyan naman lahat. Binanggit ko lang dahil limited yung LM Studio sa LLMs. VLMs, at embeddings. It can only handle static configurations of online LLM apis or use LM Studio as an endpoint for your other apps using the local models or mixture of both. For dynamic routing of multiple apis from same or different providers, pwede yang i-partner sa litellm - from api collections. May variety na yung LLM choices for specific tasks na di kaya ng local AI (for the hardware at hand). Sa setup mo, pwede na lahat from music generation na suno grade, create your own Filipino TTS voice model, HQ video generation, atbp. sa dami ng China models na lumabas ngayon he he. Intel Core 9 na lang kulang ko with at least 64 GB at a decent RTX card or alternative na compute-only (monitor-less) GPU with 24GB VRAM kahit luma (~10K) for the AI projects...Si misis lang ang problema ko kahit akin yung pondo he he! Putris talaga itong mga "maybahay" na ito.You do not have permission to view the full content of this post. Log in or register now.
Salamat sa tips.
Dagdag mo yung api ng deepseek-3.1 dyan: You do not have permission to view the full content of this post. Log in or register now.DeepSeek v3.1
Usually ginagamit ko sya for asking about networking. Also kapag mag tratranslate EN -> CN, CN -> EN
minsan ko lang nman sya magamit... gpu mainly for editing and rendering 3d ko 'to ginagamitUsually, nandyan naman lahat. Binanggit ko lang dahil limited yung LM Studio sa LLMs. VLMs, at embeddings. It can only handle static configurations of online LLM apis or use LM Studio as an endpoint for your other apps using the local models or mixture of both. For dynamic routing of multiple apis from same or different providers, pwede yang i-partner sa litellm - from api collections. May variety na yung LLM choices for specific tasks na di kaya ng local AI (for the hardware at hand). Sa setup mo, pwede na lahat from music generation na suno grade, create your own Filipino TTS voice model, HQ video generation, atbp. sa dami ng China models na lumabas ngayon he he. Intel Core 9 na lang kulang ko with at least 64 GB at a decent RTX card or alternative na compute-only (monitor-less) GPU with 24GB VRAM kahit luma (~10K) for the AI projects...Si misis lang ang problema ko kahit akin yung pondo he he! Putris talaga itong mga "maybahay" na ito.
Dagdag mo yung api ng deepseek-3.1 dyan: You do not have permission to view the full content of this post. Log in or register now.
(Di ko na nilagyan ng invitation link for the credits.)
May free $100 to $225 free credits upon login. Good to use kung ayaw mong mag-lag pc using other tasks while online.
Ayos naman pala yang hardware mo - nakaka-inggit he he! Marami pang spare VRAM for other tasks. Di ko area yan. Sa data and knowledge processing yung target ko using offline,online AI or mixed to cope with my hardware limitations.minsan ko lang nman sya magamit... gpu mainly for editing and rendering 3d ko 'to ginagamit
5090 user btw
Congrats din sa'yo, bossing, because of your forum, we cannot meet these Pinoy enthusiasts. Gusto ko naman maka connect ako sa mga enthusiast pagdating sa technology. Love reading this good discussion sharing their insights, knowledge and information. Mas iba talaga feeling kung human interaction. na realize ko lang ito nung napanood ko yung podcast ng Rota Wheels from Autoindustriya.com sa YouTube kung pano din sila nag meet and nag establish ng brand and daming connection through forum. Probably ka era mo sila boss draft. hehe so back to topic.
Nag setup ako ng gemma e2b sa low end spec ng work pc ko haha ang tagal mag results xD
Run mo dapat sa GPU, iGPU, or NPU para medyo mabilisNag setup ako ng gemma e2b sa low end spec ng work pc ko haha ang tagal mag results xD
.Tinry ko sakin, ambilis

Kung talagang low-end tulad ng akin he he, kaya mong pabilisin yan ng kaunti gamit yung ONNX model (Q4_K_M) na 3.2GB. Sa CPU mode mga 8-10 tokens/sec sa old 3rd-gen Quad ko. Pasok pa rin sa normal reading speed ng tao na 5 - 8 tokens/sec (~6 words/sec). May initial delay lang talaga bago sumagot kahit sa 4GB GPU, lalo pa sa CPU mode. May mga nabasa akong setup using gemma e2b ONNX models for ARM like Rasbberry Pi5 na 8 - 12 tokens/sec.Nag setup ako ng gemma e2b sa low end spec ng work pc ko haha ang tagal mag results xD
5090 ba ito?

yung GPU ko is GeForce 1050 Ti lang hahaRun mo dapat sa GPU, iGPU, or NPU para medyo mabilis.
anong prompt at gpu boss? hahaTinry ko sakin, ambilisView attachment 4132257
Ganda nya rin gamitin compare sa ibang models na nakainstall sakin.
Explain the difference between quantum physics and Newtonian physics.
salamat po boss draftYou do not have permission to view the full content of this post. Log in or register now.
Salamat sa tips.