Advanced Quantization Techniques For Large Language Models

af6300aaaea7e89c01b6410fd2245c86.jpg

Advanced Quantization Techniques For Large Language Models
Released 1/2026
With Nayan Saxena
MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz, 2 Ch
Skill level: Advanced | Genre: eLearning | Language: English + subtitle | Duration: 1h 10m | Size: 130 MB​
Master advanced quantization techniques for transformer models, from mathematical foundations to practical applications, maximizing efficiency while preserving model quality.
Course details
Discover cutting-edge quantization techniques for large language models, focusing on the algorithms and optimization strategies that deliver the best performance. Instructor Nayan Saxena begins by covering mathematical foundations, before progressing through advanced methods including GPTQ, AWQ, and SmoothQuant with hands-on examples in Google Colab. Along the way, gather quick tips to master critical concepts such as precision formats, calibration strategies, and evaluation methodologies. Leveraging both theoretical principles and practical applications, this course equips you with in-demand skills to significantly reduce model size and accelerate inference while maintaining performance quality.
Skills covered
Large Language Models (LLM), Model Training, Quantization Techniques


You do not have permission to view the full content of this post. Log in or register now.
 

Similar threads

About this Thread

  • 0
    Replies
  • 99
    Views
  • 1
    Participants
Last reply from:
angilok50

Online now

Members online
1,106
Guests online
1,710
Total visitors
2,816

Forum statistics

Threads
2,278,116
Posts
28,980,983
Members
1,228,230
Latest member
yajieme
Back
Top