ECE-GY 9483/CSCI-GA 3033

Special Topics in Electrical Engineering

EFFICIENT AI AND HARDWARE ACCELERATOR DESIGN

Spring 2025

 

The course will focus on recent advancements in the design of efficient neural networks, specifically on how to create and optimize AI models for improved performance, scalability, and resource efficiency. Students will explore key techniques like model compression, pruning, quantization, and model distillation for CNN, RNN, Transformer and LLM, aimed at reducing computational complexity and memory usage while maintaining accuracy. Additionally, the course will cover efficient training and inference methods, including distributed computing, parallelism, and low-precision computation, which are crucial for deploying AI on resource-limited devices such as smartphones or edge computing systems. Students will also study advanced hardware architectures of AI accelerators, gaining insights into the hardware implementation of neural network computations on these specialized systems.

 

Lecture Time: Wednesday 5:00-7:30pm EST (Zoom)

Lecture Location: Room 473, Jacobs Academic Bldg

Readings: Course slides and papers

Suggested readings: Goodfellow, Ian. "Deep learning." (2016). https://www.deeplearningbook.org/

Evaluation Breakdown:

  • Participation (5%)

  • Assignments (30%): total three of them, each counts 10%

  • In-course presentation (10%) 

  • Midterm (25%) 

  • Final project (30%)

    • Proposal (1 page) 5%

    • Final presentation 15%

    • Final report 10%

Late Submission Policy: 

  • A 10% deduction will be applied to the original grade if submitted within 24 hours; otherwise, a 30% deduction will be applied.

Syllabus 

 

In-class Presentation Format

Presentation Assignment

 

Contact:

  • For questions related to course materials, use efficientaiaccelerator@gmail.com 
    • When you send your email, please start the email title with three categories to indicate the type of the question: [Algorithm], [Hardware], [Others].
  • For other inquiries, personal matters, or emergencies, you can email me at sai.zhang@nyu.edu

Instructor
Sai Zhang

Office Hour: Friday 1:30pm-2:30pm

Zoom link

Course Assistants
Zhenyuan Dong

Office Hour: Wednesday 8:30am-9:30am

Zoom link

Aditya Ojha

Office Hour: Thursday 11:00am-12:00pm

Zoom link

Grader
Khushi Sharma

Course Schedule

Date
Topic
Logistics

Jan 22

Lecture 0: Course Introduction

Lecture 1: Basic topics of DNN

 

Jan 29

Lecture 2: Convolutional Neural Networks & RNN (Recording)

 

Feb 05

Lecture 3: Transformer and its Application in AIGC (Recording)

 

Feb 12

Lecture 4: Pruning Strategies for Efficient DNN Implementation (Recording)

Assignment 1 is posted on Brightspace

Feb 19

Lecture 5: DNN Quantization (Recording)

 

Feb 26

Lecture 6: Distillation, Low Rank Decomposition and Neural Architecture Search 

Assignment 1 due

Assignment 2 out

Mar 05

Lecture 7: Efficient DNN Training

 

Mar 12

Lecture 8: Machine Learning System for Training and Inference

Assignment 2 due

Assignment 3 out

Mar 19

Lecture 9: CNN Dataflow & Hardware Accelerators

Project proposal due

Mar 26

Spring Break

 

Apr 02

In-class midterm exam

Assignment 3 due

Apr 09

Lecture 10: Transformer & LLM Accelerators

 

Apr 16

Lecture 11: Hardware Accelerator for DNN Training

 

Apr 23

Lecture 12: New Computation Paradigms

Invited Talk

Apr 30

Lecture 13: New playground for Efficient AI: AR/VR

Invited Talk

May 14

Final Presentation