Skip to content

Data Wrangling & Visualisation for Data Scientist

Awantik Das edited this page Aug 2, 2019 · 2 revisions

Introduction to Data Wrangling

  1. The journey of a Data Scientist
  2. Why Python becomes the default choice
  3. Tools vs Libraries
  4. Popular libraries
  5. Objectives of Data Wrangling
  6. Challenges of Big Data Ecosystem

Gathering Raw Data

Revision: Concept of python iterator

  1. SQL Databases
  2. NoSQL Databases
  3. Parse XML - Simple & Semi-structured
  4. CSV Parser
  5. From REST Interface
  6. Parsing HTML
  7. JSON parsing
  8. Connecting with Twitter

Fundamentals of NumPy & Pandas

Data Cleaning

  • 5 Projects

Shaping & Structuring

Manipulation of Text Data

Storing cleaned data for Data Analysis

Clone this wiki locally