Python for Data Science

Course Outline Download
Introduction Download
DayModulesSharing
1
  • Introduction to Python and Data Science
  • Installing and setting up the Python environment
  • Basic Python syntax and data types
  • NumPy, sets, list, and tuples data storage
  • Control structures in Python
  • Creating and using functions, date and time functions
  • Laboratory exercise – My first Python program
Useful References:
NumPy
2
  • Comprehension of the use of lists and indexing data with dictionaries
  • Sub-setting (Lists, Matching, Handling Missing Values)
  • Reading and writing real-world data (text file, CSV and SQL)
  • Introduction to pandas and managing Data Frames
  • Laboratory exercise – Working with Python pandas data frame
Useful References:
Pandas
3
  • Shaping data for analysis
  • Handling raw text, using bag of words, TF-IDF and DTM shaping
  • Sub-setting observations & variables, summarizing data
  • Laboratory exercise – Getting, format (sub-setting) and store data.
  • Feature creation, combining variables
  • Understanding binning, discretization and data distribution
  • Laboratory exercise – Preparing extracted data analysis
4
  • Introduction to MatPlotLib
  • Exploratory graphs in Python
  • Visualizing data
  • Laboratory exercise – Visualizing data
  • Wrangling data and reducing dimensionality (feature selection)
  • Basic clustering of data and handling outliers
  • Laboratory exercise – Presenting analytics using Python
Useful References:
Matplotlib
SQL
Others
5
  • Regression
  • Association Rule and Frequent Itemset Mining
  • Knn, Decision Trees and Random Forest
  • Support Vector Machines
  • Text processing using NLTK
  • Laboratory exercise – Text classifier
  • Wrap-up
  • Day5.zip
  • Briefying on AI
  • Useful References:
    NLP Tutorials: Useful Links:
    ML Tutorials:
    Web Scraping

    Recommanded Books
    TitleSerious Python: Black-Belt Advice on Deployment, Scalability, Testing, and More
    ISBN978-1-593-27878-6
    AuthorJulien Danjou
    Year2019
    Publisher No Starch Press
    TitlePython re: a magical tool for text processing
    ISBN
    AuthorSundeep Agarwal
    Year2019
    Publisher Sundeep Agarwal
    TitleNumpy Programming, For Beginners, Quick Start Guide: Numpy Language Crash Course Tutorial & Exercises
    ISBNB09446DZQH
    AuthorYao, Ray
    Year2021
    Publisher In Easy Step By Step, Teach Yourself eBook & Book
    TitlePython Data Analytics: Mastering Python for Effective Data Analysis and Visualization
    ISBNB0CW28VQJ4
    AuthorFloyd Bax
    Year2024
    Publisher FLOYD BAX; 1st edition
    TitlePandas for Everyone: Python Data Analysis
    ISBN978-0-137-89115-3
    AuthorDaniel Chen
    Year2023
    Publisher Addison-Wesley Professional
    TitleData Analysis with Python: Introducing NumPy, Pandas, Matplotlib, and Essential Elements of Python Programming
    ISBN978-9-355-51065-5
    AuthorRituraj Dixit
    Year2023
    Publisher BPB Publications
    TitlePython for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter
    ISBN978-1-098-10403-0
    AuthorWes McKinney
    Year2022
    Publisher O'Reilly Media
    TitlePandas 1.x Cookbook: Practical recipes for scientific computing, time series analysis, and exploratory data analysis using Python
    ISBN978-1-839-21310-6
    AuthorMatt Harrison, Theodore Petrou
    Year2020
    Publisher Packt Publishing
    TitleScientific Computing with Python: High-performance scientific computing with NumPy, SciPy, and pandas
    ISBN978-1-838-82510-2
    AuthorClaus Fuhrer; Jan Erik Solem; Olivier Verdier
    Year2021
    Publisher Packt Publishing Ltd
    TitleNumerical Python: Scientific Computing and Data Science Applications with Numpy, SciPy and Matplotlib
    ISBN978-1-484-24245-2
    AuthorRobert Johansson
    Year2019
    Publisher Apress
    TitlePython for Data Analysis: A Basic Programming Crash Course to Learn Python Data Science Essential Tools, Pandas, and Numpy with Questions and Answer from Beginners to Advanced
    ISBNB07Z6H5Y7H
    AuthorOscar Scratch
    Year2019
    Publisher
    TitlePython Data Analytics: With Pandas, NumPy, and Matplotlib
    ISBN978-1-484-23912-4
    AuthorFabio Nelli
    Year2018
    Publisher Apress
    TitleQuerying MySQL
    ISBN978-9-355-51267-3
    AuthorAspin, Adam
    Year2022
    Publisher BPB Publications
    TitleHands-On Python Natural Language Processing: Explore tools and techniques to analyze and process text with a view to building real-world NLP applications
    ISBN978-1-838-98258-4
    AuthorAman Kedia; Mayank Rasu
    Year2020
    Publisher Packt Publishing Ltd
    TitleLearn AI with Python: Explore Machine Learning and Deep Learning techniques for Building Smart AI Systems Using Scikit-Learn, NLTK, NeuroLab, and Keras (English Edition)
    ISBN978-9-391-39261-1
    AuthorGaurav Leekha
    Year2021
    Publisher BPB Publications
    TitleHands-On Web Scraping with Python: Extract quality data from the web using effective Python techniques
    ISBN978-1-837-63621-1
    AuthorAnish Chapagain
    Year2023
    Publisher Packt Publishing
    TitleNatural Language Processing: Python and NLTK
    ISBN978-1-787-28510-1
    AuthorNitin Hardeniya, Jacob Perkins, Deepti Chopra, Nisheeth Joshi, Iti Mathur
    Year2016
    Publisher Packt Publishing