COURSE

ECE-GY 9483/CSCI-GA 3033

Special Topics in Electrical Engineering

EFFICIENT AI AND HARDWARE ACCELERATOR DESIGN

Spring 2025

The course will focus on recent advancements in the design of efficient neural networks, specifically on how to create and optimize AI models for improved performance, scalability, and resource efficiency. Students will explore key techniques like model compression, pruning, quantization, and model distillation for CNN, RNN, Transformer and LLM, aimed at reducing computational complexity and memory usage while maintaining accuracy. Additionally, the course will cover efficient training and inference methods, including distributed computing, parallelism, and low-precision computation, which are crucial for deploying AI on resource-limited devices such as smartphones or edge computing systems. Students will also study advanced hardware architectures of AI accelerators, gaining insights into the hardware implementation of neural network computations on these specialized systems.

Lecture Time: Wednesday 5:00-7:30pm EST (Zoom)

Lecture Location: Room 473, Jacobs Academic Bldg

Readings: Course slides and papers

Suggested readings: Goodfellow, Ian. "Deep learning." (2016). https://www.deeplearningbook.org/

Evaluation Breakdown:

Participation (5%)
Assignments (30%): total three of them, each counts 10%
In-course presentation (10%)
Midterm (25%)
Final project (30%)
- Proposal (1 page) 5%
- Final presentation 15%
- Final report 10%

Late Submission Policy:

A 10% deduction will be applied to the original grade if submitted within 24 hours; otherwise, a 30% deduction will be applied.

Syllabus

In-class Presentation Format

Presentation Assignment

Project Proposal Format

Midterm Coverage

Final Present and Report Format

Contact:

For questions related to course materials, use efficientaiaccelerator@gmail.com
- When you send your email, please start the email title with three categories to indicate the type of the question: [Algorithm], [Hardware], [Others].
For other inquiries, personal matters, or emergencies, you can email me at sai.zhang@nyu.edu

Instructor

Sai Zhang

Office Hour: Friday 1:30pm-2:30pm

Zoom link

Course Assistants

Zhenyuan Dong

Office Hour: Wednesday 8:30am-9:30am

Zoom link

Aditya Ojha

Office Hour: Thursday 11:00am-12:00pm

Zoom link

Grader

Khushi Sharma

Course Schedule

Date	Topic	Logistics
Jan 22	Lecture 0: Course Introduction Lecture 1: Basic topics of DNN
Jan 29	Lecture 2: Convolutional Neural Networks & RNN (Recording)
Feb 05	Lecture 3: Transformer and its Application in AIGC (Recording)
Feb 12	Lecture 4: Pruning Strategies for Efficient DNN Implementation (Recording)	Assignment 1 is posted on Brightspace
Feb 19	Lecture 5: DNN Quantization (Recording)
Feb 26	Lecture 6: Distillation, Low Rank Decomposition and Neural Architecture Search (Recording)	Assignment 1 due Assignment 2 is on Brightspace
Mar 05	Lecture 7: Efficient DNN Training (Recording)
Mar 12	Lecture 8: Machine Learning System for Training and Inference (Recording)	Assignment 2 due Assignment 3 is on Brightspace
Mar 19	Lecture 9: AI Accelerator Introduction & CNN Accelerators (Recording)	Project proposal due
Mar 26	Spring Break
Apr 02	In-class midterm exam	Assignment 3 due
Apr 09	Lecture 10: Transformer & LLM Accelerators (Recording)
Apr 16	Lecture 11: Hardware Accelerator for DNN Training (Recording)
Apr 23	Lecture 12: New playground for Efficient AI: AR/VR (Recording)
Apr 30	Lecture 13: New Computation Paradigms (Recording)
May 13/14	Final Presentation

ECE-GY 9483/CSCI-GA 3033

Special Topics in Electrical Engineering

EFFICIENT AI AND HARDWARE ACCELERATOR DESIGN

Spring 2025

Instructor

Sai Zhang

Course Assistants

Zhenyuan Dong

Aditya Ojha

Grader

Khushi Sharma

Course Schedule

Date

Topic

Logistics