Ask a question
Join us for IBM TechXchange Day: AI and Automation
Description:Most areas of Python data science have standardized Pandas DataFrames for representing and manipulating structured data in memory. Natural Language Processing, though, not so much. In this presentation, we’ll explain why you should be using Pandas for NLP. Pandas DataFrames make every phase of NLP easier, from creating new models to evaluating their effectiveness to building applications that integrate those models. We’ll talk about our open source library, Text Extensions for Pandas, which adds special data types and library integrations specifically geared to NLP use cases. We’ll also explain how these extensions connect to some basic NLP concepts, and then we’ll finish with an example of using Pandas to build an NLP application.
Speaker bio:Fred Reiss is a Principal Research Staff Member at IBM Research and Chief Architect at IBM’s Center for Open-Source Data and AI Technologies (CODAIT). He is also one of the authors of the Text Extensions for Pandas library. Fred received his Ph.D. from U.C. Berkeley in 2006 and immediately IBM Research, joining the CODAIT center in 2015. Fred has written multiple peer-reviewed papers in the areas of natural language processing, database systems, and machine learning.