Data Science and Big Data Analytics (DSBDA)

This repository serves as a comprehensive resource for the Data Science and Big Data Analytics course, featuring notes, code samples, handouts, and previous year question papers. It supports course outcomes such as analyzing challenges in data science, applying statistics, implementing big data analytics with Python, and utilizing visualization tools. Additionally, it covers the design of big databases using the Hadoop ecosystem, ensuring a thorough understanding of the technologies and strategies essential for effective data analytics.


Tip

Want to contribute? Start by opening an issue in this repository!

Index

Notes

  1. Unit 1 - Introduction to Data Science and Big Data
  2. Unit 2 - Statistical Inference
  3. Unit 3 - Big Data Analytics Life Cycle
  4. Unit 4 - Predictive Big Data Analytics with Python
  5. Unit 5 - Big Data Analytics and Model Evaluation
  6. Unit 6 - Data Visualization and Hadoop

Codes

  1. Code-A1 (Data Wrangling-1)
  2. Code-A2 (Data Wrangling-2)
  3. Code-A3 (Descriptive Statistics)
  4. Code-A4 (Data Analytics-1)
  5. Code-A5 (Data Analytics-2)
  6. Code-A6 (Data Analytics-3)
  7. Code-A7 (Text Analytics)
  8. Code-A8 (Data Visualization-1)
  9. Code-A9 (Data Visualisation-2)
  10. Code-A10 (Data Visualisation-3)
  11. Code-B1 (Hadoop Word Count)
  12. Code-B4 (Apache Scala)

Notebooks

  1. Notebook-A1 (Data Wrangling-1)
  2. Notebook-A2 (Data Wrangling-2)
  3. Notebook-A3 (Descriptive Statistics)
  4. Notebook-A4 (Data Analytics-1)
  5. Notebook-A5 (Data Analytics-2)
  6. Notebook-A6 (Data Analytics-3)
  7. Notebook-A7 (Text Analytics)
  8. Notebook-A8 (Data Visualization-1)
  9. Notebook-A9 (Data Visualization-2)
  10. Notebook-A10 (Data Visualization-3)

Practical

Each folder contains handout, write-up and softcopy (i.e. code + output).

  1. Assignment-A1
  2. Assignment-A2
  3. Assignment-A3
  4. Assignment-A4
  5. Assignment-A5
  6. Assignment-A6
  7. Assignment-A7
  8. Assignment-A8
  9. Assignment-A9
  10. Assignment-A10
  11. Assignment-B1
  12. Assignment-B2
  13. Assignment-B4

Question Papers

END-SEM PYQ Answers


Miscellaneous

-> Disclaimer: Please read the DISCLAIMER file for important information regarding the contents of this repository.

-> Note: Content such as codes, softcopies, write-ups and question papers is provided by us, i.e. our contributors. You are free to use this content however you wish, without any restrictions. Some of the notes and handouts have been provided by our professors, thus to use them for anything other than educational purposes, please contact them.

-> Maintained by:

-> Contributors:

  • Afan Shaikh
  • Ayush Kalaskar
  • Himanshu Patil
  • Shriniwas G
  • Vedant Jamodkar

-> Repository icon from Icons8.

-> Motto:

Making information freely accessible to everyone.

-> Keywords:

SPPU, Savitribai Phule Pune University, Pune University, Computer Engineering, COMP, Third Year, TE, Semester 6, SEM-6, Data Science and Big Data Analytics, DSBDA, DSBDA codes, DSBDA notes, SPPU DSBDA notes, DSBDA handouts, DSBDA softopy, SPPU DSBDA code and output, DSBDA question papers, DSBDA PYQs


S
Description
This repository serves as a comprehensive resource for the Data Science and Big Data Analytics course, featuring notes, code samples, handouts, and previous year question papers. It supports course outcomes such as analyzing challenges in data science, applying statistics, implementing big data analytics with Python, and utilizing visualization tools. Additionally, it covers the design of big databases using the Hadoop ecosystem, ensuring a thorough understanding of the technologies and strategies essential for effective data analytics.
Readme 56 MiB
Languages
Jupyter Notebook 99.7%
Java 0.3%