Over the last few months I worked with many leaders in the open-source data space to put together a community day on Oct. 21 at IBM TechXchange 2024, IBM's developer conference. I'm beyond thrilled with the agenda and even more excited to get so many of the OS data community together in Las Vegas! 🙌
If you want to get started or dive into some of the most popular projects in open-source data, this will be a great event to attend. The lineup (it's long!) includes:
👉 Apache Arrow, the columnar memory format software framework, from Matt Topol, Voltron Data
👉 Apache #Gluten, a plugin for Spark that improves its performance, from creator binwei yang
👉 #Velox, a C++ database acceleration library, from Pedro Pedreira, Meta
👉 Apache Polaris (Incubating), a REST catalog for Apache #Iceberg, from Eric Maynard, Snowflake
👉 Apache Hudi, a transactional data lakehouse platform, from Sivabalan Narayanan, Onehouse
👉 #Presto, a SQL query engine for data analytics, from Tim Meehan, IBM
👉 Apache #Ranger, a framework that manages platform security, from Don Bosco Durai, Privacera
👉 #Ibis, a #Python dataframe library, from Gilbert Forsyth, Voltron Data
👉 #Presto C++, the next-gen version of Presto built on #Velox, from Aditi Pandit, IBM
👉 Apache #Pinot, a real-time distributed OLAP datastore, from Barkha Herman, StarTree
👉 Apache Airflow, a tool to manage data workflows, from Kenten Danas, Cody Rich, Jacob Roach at Astronomer
👉 Apache #Superset, a BI data visualization tool, from Evan Rusackas and Beto Dealmeida, Preset
👉 How Meta uses #Presto at scale from Amit Dutta, Meta
Details and registration:
https://lnkd.in/eKbTS66C
Finally, since you made it this far 😉 I have some complimentary conference passes if you'd like to participate in any of these open-source projects! Drop me a DM.
See you there 😎
Ali
#watsonx.ai
#GenerativeAI