YoVDO

Responsible & Safe AI Systems

Offered By: NPTEL via Swayam

Tags

Computer Science Courses Artificial Intelligence Courses Machine Learning Courses Deep Learning Courses Privacy Courses Fairness Courses Interpretability Courses Adversarial Attacks Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
ABOUT THE COURSE: There has been an exponential increase in the use of platforms / technologies like ChatGPT, Gemini, Llama, Sora, DALL-E, etc. in our day-to-day lives. There Language & Vision models have changed our way of living, and way we seek & create information. This course provides students with a comprehensive understanding of the ethical, social, and safety considerations essential for developing and deploying artificial intelligence (AI) systems. Uncover the intricacies of algorithmic transparency, fairness in machine (un)learning, interpretability, consistency and many more. The course encourages critical thinking and fosters a deep appreciation for the impact of AI on individuals and communities. Students who complete the course can: recognize possible harms that can be caused by modern AI capabilities; learn to reason about various perspectives on the trajectory of AI development and proliferation; learn about latest research agendas towards making AI systems safer.INTENDED AUDIENCE: Anybody interested in the area of AI, Machine Learning, including industry professionals and studentsPREREQUISITES: Any level of machine learning / AI course would help, it is not mandatory thoughINDUSTRY SUPPORT: TCS, Wipro, Microsoft, Infosys, Amazon, Uber, to name a few, any company involved in AI & ML will be interested

Syllabus

Week 1 &2:
  • AI Capabilities Improvement in last 5-10 years
  • Imminent risks from AI Models: Toxicity, bias, goal misspecification, adversarial examples etc.
  • Long-term risks from AI Models: Misuse, Misgeneralization, Rogue AGI
  • Principles of RAI - Transparency; Accountability; Safety, Robustness and Reliability;Privacy and Security; Fairness and non-discrimination; Human-Centred Values; Inclusive
    and Sustainable development, Interpretability
  • Recap of Deep Learning Techniques, Language/Vision Models
  • AI Risks for Gen models
  • Adversarial Attacks – Vision, NLP, Superhuman Go agents
Week 3 &4:
  • ML Poisoning Attacks like Trojans
  • Implications for current and future AI safety
  • Explainability
  • Imminent and Long-term potential for transparency techniques
  • Mechanistic Interpretability
  • Representation Engineering, model editing and probing
  • Critiques of Transparency for AI Safety
Week 5 &6:
  • Privacy & Fairness in AI
Week 7 &8:
  • Metrics and Tools for RAI - measuring bias/fairness, adversarial testing, explanations (Lime/SHAP/GradCam), audit mechanisms
  • Regulation landscape - DPDP act (India), GDPR (EU), EU AI act, US presidential declaration, Ethical approvals, informed consent, participatory design, future of work, Indian context
  • What is AGI? When could it be achieved?
  • Instrumental Convergence: Power Seeking, Deception etc.
Week 9 &10:
  • RAI in Legal domain
  • RAI in Health care domain
  • RAI in Education domain
  • A few other domains
  • Policy issues in RAI
Week 11&12:
  • Couple of panel discussion with industry practitioners, academic, government (possibly), and others.
  • Fireside chat with eminent personalities
  • Recorded Paper reading discussion

Taught by

Prof. Ponnurangam Kumaraguru, Prof. Balaraman Ravindran, Prof. Arun Rajkumar

Related Courses

AWS Engenheiro de ML AWS Associate 2.1: Como definir uma estratégia de modelagem (Português) | AWS ML Engineer Associate 2.1 Choose a Modeling Approach (Portuguese)
Amazon Web Services via AWS Skill Builder
AWS ML Engineer Associate 2.1 Choose a Modeling Approach (Korean)
Amazon Web Services via AWS Skill Builder
AWS ML Engineer Associate 2.1 Choose a Modeling Approach (Simplified Chinese)
Amazon Web Services via AWS Skill Builder
Responsible AI for Developers: Interpretability & Transparency - Polski
Google Cloud via Coursera
Responsible AI for Developers: Interpretability & Transparency - 日本語版
Google Cloud via Coursera