Skip to main content

How Data Lies

How Learning to Lie with Data is Essential to Prevent AI being Sexist and Racist

Offering practical actionable support to data scientists who are making efforts to be responsible, while recognising why it is hard to do so.

The course can be found  here.

Course Summary

This course title "How learning to lie with data is essential to prevent AI from being sexist and racist." is intended to catch attention but also highlights the content of this course which intends to support data scientists looking to do responsible AI. The first part of the title comes from a book from 1954 titled "How to lie with statistics" which has been brought back into consciousness through another book "Rebooting AI". The first part of this course presents elements of how data can be misleading, while providing concrete tips to identify and address these data issues. The second part of the title refers to a series of recent scandals where it is argued that AI has not been used responsibly. These scandals, some of which are used as case studies in this course, are leading to the legislation coming in to ensure ethical uses of AI. The second half of this course is focussed on these ethical considerations needed for using AI responsibly. The course aims to support Data Scientists and their managers to increase their understanding of potential ethical challenges in the application of AI and provide concrete tips to support them to be responsible.

Who is it for?

This course is designed primarily for Data Scientists who are actively looking to be responsible in their work. Part of it is also intended to be appropriate for managers of data scientists or even their collaborators who may benefit from the broad discussions but skip some of the practical details.

Learning Objectives

By the end of this course, learners will have:

  • an awareness of some ethical considerations which are shaping the future of AI and why data scientists need to be responsible in their role.
  • been exposed to some common pitfalls where data mis-interpretation can arise and be presented with concrete advice to avoid them.

Learners may have:

  • gained practical experience working with data to draw correct conclusions in data containing complexities.

Course Details

  1. Introduction

  2. Data Considerations We have three approaches to consume the content in this section - A Content Approach, A Case Study Approach, A Practical Approach. All three approaches will cover the same case studies and content blocks, which are:

  • 2a- Content
    Module 1 - Definitions Matter
    Module 2 - Data Matters
    Module 3 - Variability Matters
    Module 4 - Interactions Matter
  • 2b- Case Studies
    i) COMPAS Case Study
    ii) Apple and Amazon Case Study
    iii) Ofqual Case Study
    iv) Protein Folding Case Study
  • 2c- Practical Approach
    Interactive example to consume the content using STACK.
  1. Ethical Considerations
  • 3a- Introducing ethics in AI
  • 3b- Fairness and debiasing
  • 3c- AI ethics beyond debiasing
  • 3d- Accreditation
  1. Conclusion

Behind the Course

 This course was developed with The Alan Turing Institute and IDEMS International, in collaboration with partners from AI Ghana, Universitat Bonn, Center for Science and ThoughtZertifizierte KILancaster University, and Caltech

Popular posts from this blog

AI Meetup - The Magic of Generative AI: From Imitation to Innovation

Welcome to our upcoming AI Meet-up where we will delve into "The Magic of Generative AI: From Imitation to Innovation."  In this session, we will embark on a fascinating journey into the world of generative AI, exploring how this cutting-edge technology has transcended the boundaries of imitation and entered the realm of true creativity and innovation.  We will uncover the underlying principles of generative AI, particularly focusing on the transformative power of Generative Adversarial Networks (GANs). Through captivating examples and real-world applications, we will witness how generative AI is reshaping industries and unleashing the potential for novel creations. As we explore the boundless possibilities of generative AI, we will also discuss the ethical considerations that come with this transformative technology. Join us in this engaging meet-up to discover the magic of generative AI and its potential to shape the future of artificial intelligence. Get ready to be inspir

AI Meetup - Data Science Fundamentals

AI Ghana, IoT Network Hub and partners present our next AI Meetup. It takes place on 24th June, 2023 at 1pm GMT. The meetup is on Data Science Fundamentals. This workshop will be facilitated by Emmanuel Agbeli who is a data scientist at Aya Data. It is meant to be a very practical session where participants will get to learn some fundamentals of data science by getting their hands dirty working with data to solve a specific problem.  Book your place by filling the form below (before filling this form, be sure to read our Privacy Policy and our Terms of Service ):  Applications are currently closed. Find your way to the venue with Google Maps

My Journey through the AI GHANA Mentorship Program

  I was browsing LinkedIn one day when I saw a post by AI Ghana looking for individuals to mentor in data science, machine learning, and artificial intelligence. I tried my luck by applying, and in a few days I was invited for an interview to learn more about me. After the interview, in a few weeks I was accepted into the mentorship program out of a large number of applicants.  To add to that, I was very happy that day that I received an email that I had been accepted into the mentorship program. Since I am an artificial intelligence (AI) enthusiast and I didn't know where to start, this was a great opportunity for me to start my journey in the field. Besides, I am a geomatic engineering student, and I want to be a geospatial developer or geospatial data analyst in the future. Being in this mentorship is a step torward becoming who I want to be. Finally, my mentor, Godliver Alangyam, introduced me to the Python programming language, in which she taught me how to use some data scien