Advanced Quantization Techniques For Large Language Models

af6300aaaea7e89c01b6410fd2245c86.jpg

Advanced Quantization Techniques For Large Language Models
Released 1/2026
With Nayan Saxena
MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz, 2 Ch
Skill level: Advanced | Genre: eLearning | Language: English + subtitle | Duration: 1h 10m | Size: 130 MB​
Master advanced quantization techniques for transformer models, from mathematical foundations to practical applications, maximizing efficiency while preserving model quality.
Course details
Discover cutting-edge quantization techniques for large language models, focusing on the algorithms and optimization strategies that deliver the best performance. Instructor Nayan Saxena begins by covering mathematical foundations, before progressing through advanced methods including GPTQ, AWQ, and SmoothQuant with hands-on examples in Google Colab. Along the way, gather quick tips to master critical concepts such as precision formats, calibration strategies, and evaluation methodologies. Leveraging both theoretical principles and practical applications, this course equips you with in-demand skills to significantly reduce model size and accelerate inference while maintaining performance quality.
Skills covered
Large Language Models (LLM), Model Training, Quantization Techniques


You do not have permission to view the full content of this post. Log in or register now.
 

Similar threads

About this Thread

  • 0
    Replies
  • 93
    Views
  • 1
    Participants
Last reply from:
angilok50

Online now

Members online
771
Guests online
1,913
Total visitors
2,684

Forum statistics

Threads
2,268,260
Posts
28,921,209
Members
1,242,901
Latest member
kurt3434
Back
Top