AI and DS Skills

 View Only

Data science for all – an open source approach to education

By Ana Echeverri posted Thu September 19, 2019 07:13 PM


ana.jpegToday is an exciting day for me. After months of hard work, IBM, the University of Pennsylvania, and the Linux Foundation are announcing an innovative, first-of-a-kind open source project that will enable universities around the world to build Data Science programs faster.

With IBM’s investment and industry expertise, University of Pennsylvania’s long-standing academic leadership and the Linux Foundation as a premier open source consortium, we are creating a curriculum kit comprised of a set of open source building blocks for teaching the core concepts of data science in undergraduate and graduate programs. These building blocks are based on Python and open source tools and frameworks, and include slides, documentation, code, and data sets that could be adopted or updated by anyone.

This idea of open source Data Science education is personal to me. Access to education changed my life.  Coming from a small town in Colombia, South America, education gave me the opportunity to work with cutting edge Data Science and AI technologies at one of the best companies in the world (IBM).  I believe this project will provide a foundation of building blocks for schools to supplement, strengthen and start up their data science programs. And most importantly, because this is open source, it enables any institution on earth thus providing more opportunities for learners to  participate in the AI Economy like I did.

 When I first started this project, I met with universities in different regions of the world and a common theme emerged: starting a Data Science program from scratch is incredibly difficult, and universities need educational materials to accelerate their efforts. This was not only encouraging but validated the need: there is a demand worldwide and this concept of open source education could reach across oceans and to our local community colleges.

 By making a “starter set” of training materials available and providing guidance on how to build a Data Science program, IBM and cross-industry partners and educators working together can help accelerate the availability of skills building programs around the world.

It is the beginning of a new era for Data Science Education. 

The project is in incubation currently as IBM and UPenn create the initial set of materials to contribute.  The project will officially launch in early 2020. To get early insights and stay up to date with this project please register here.




Tue February 11, 2020 05:03 PM

Great! Thank you for sharing this info with us

Fri February 07, 2020 01:41 PM

Great innovation with superb foresight. Thanks for the contribution to humanity.

Thu February 06, 2020 05:36 PM


Wed February 05, 2020 06:48 PM

Great Ana,

we are all eagerly waiting for it!!!


Fri November 08, 2019 12:23 PM

The project is still in incubation under the governance of the Linux Foundation. The curriculum kit will be hosted on github once the project comes out of incubation early in 2020.

Fri November 08, 2019 12:07 PM

Will it be available for us at the IBM Data Science Community here, a bit in advance? Is there already a course offered, based on this new open source framework?

Mon October 07, 2019 04:10 AM the approach and look forward hearing from you.

Wed September 25, 2019 03:53 PM

​Wonderful idea, thank you for your great effort. I signed up for the emailing list and  hope to contribute to and adopt the material.