Part 2: How do I Start Conducting Text Analysis: Data Preprocessing

In this second Text Analysis Virtual Workshop series, we will cover data preprocessing, which is a set of steps and techniques applied to raw text data before analysis. These steps include tokenization, lowercasing, stop word removal, stemming and lemmatization, removing special characters, and further text cleaning. The goal of data preprocessing is to transform and prepare the text data for further analysis, ensuring that the data is accurate, consistent, and suitable for extracting meaningful insights. Join us to learn and practice how to clean, tokenize, remove stop words, and perform stemming or lemmatization on text data to prepare the data corpus for text analysis.

-

    

      

Ticketing

Event date

Event website

Hosting/Sponsoring organization/Unit

Unit for Data Science and Analytics Unit for Data Science and Analytics

Event for

Current students Alumni Faculty and staff

Event location

Online

Building/Room/Location

Zoom

Email for event inquiries

datascience@asu.edu

Map

RSVP

Audience

Event Category

Name for event inquiries

Kerri Rittschof

Phone number for event inquiries