commit e775edbb639661a10f28f3b0727e0229232314bd Author: Kshiitij Date: Wed Jun 11 13:43:41 2025 +0530 Pruned old main branch & commits. Created a new one. diff --git a/README.md b/README.md new file mode 100644 index 0000000..6cb1104 --- /dev/null +++ b/README.md @@ -0,0 +1,3 @@ +# Data Science and Big Data Analytics (DSBDA) + +--- diff --git a/git-commit-logs.txt b/git-commit-logs.txt new file mode 100644 index 0000000..f3dc8fc --- /dev/null +++ b/git-commit-logs.txt @@ -0,0 +1,549 @@ +commit 89cad5bd9f5b74dfb9f050facd692a33296471f3 +Author: Kshitij +Date: Mon Jun 9 22:14:16 2025 +0530 + + Added a list of contributors in readme. + +commit 490af18fba1888079904713c9c758c6a61c1b37e +Author: Kshitij +Date: Mon Jun 9 22:10:35 2025 +0530 + + Added solved may june 2024 end-sem question paper. Provided by Vedant Jambo. + +commit 192ee91172e03e9ae3d4459321dfda593c62c5fa +Author: Kshitij +Date: Mon Jun 9 19:21:23 2025 +0530 + + Added description in README. + +commit b459b8ae0644affabdc211c4de8dfbafed5a726a +Author: Kshitij +Date: Mon Jun 9 19:14:36 2025 +0530 + + Scala code has been tested, added da badge. + +commit 3068cca2207ddafcd2b20343f42a659b4a72f52c +Author: Kshitij +Date: Mon Jun 9 19:08:58 2025 +0530 + + Added additional Scala codes and referenced them in main scala code markdown file. + +commit c93082840d1ce30a61a5e91c2d3bba699f07a4dd +Author: Kshitij +Date: Mon Jun 9 19:02:19 2025 +0530 + + Added notes for some topics of unit 5 and unit 6, written by yours truly. + +commit 73dd14599eef08059f5bc9f0191341db99b3cb91 +Author: Kshitij +Date: Mon Jun 9 17:51:36 2025 +0530 + + Added link to end-sem pyq answers in readme. + +commit 41c609ba15ed1907810cd015b84e826c702a42f9 +Author: Kshitij +Date: Mon Jun 9 17:49:56 2025 +0530 + + Added handwritten notes, written by Shriniwas G. + +commit 701f2af301cbc5ee3a5dd12afb74fe37be224de3 +Author: Kshitij +Date: Mon Jun 9 17:19:16 2025 +0530 + + Added answers for end-sem pyqs. Assembled by Himanshu Patil. + +commit b462cb3acfe002f4af5356da94308e2f89456466 +Author: Kshitij +Date: Sat May 24 22:54:49 2025 +0530 + + Added solved pyq numericals for dsbda. + +commit 7b80f55d3b263f0e1211c4589ba993613a892dd9 +Author: Kshitij +Date: Sat May 17 20:58:26 2025 +0530 + + Added nov-dec 2024 pyq, provided by Afan. + +commit 06507b39c8c40271e3a4f121a6e378effb1ad395 +Author: Kshitij +Date: Sun May 11 21:57:42 2025 +0530 + + Added ma'am ppts for unit 5 and 6, provided to us by Shriniwas. + +commit 19e377798bceb61eb9073e55a0a0c0169e9a4c65 +Author: Kshitij +Date: Mon Apr 28 23:30:27 2025 +0530 + + Added POS tagging in codes and notebook for A7. + +commit fce74c76f94804c2f71956bef8d91823534047a0 +Author: K +Date: Mon Apr 28 09:48:32 2025 +0530 + + fixed 20gb, its 2gb + +commit ab26225202501913692c2ce27d2a8ce9e30af342 +Author: Kshitij +Date: Sun Apr 27 23:19:56 2025 +0530 + + Final changes to steps file for now. + +commit d35c267987da2a96507086570f0ce4722fa1a8e8 +Author: Kshitij +Date: Sun Apr 27 23:09:42 2025 +0530 + + Added JAR file for wordcount. + +commit cbc448462e644e29ba34d1153585efb9fd56d386 +Author: Kshitij +Date: Sun Apr 27 23:08:55 2025 +0530 + + Updated steps for Hadoop wala code but still haven't tested it tho + +commit ff604e972b43bc3a95f5061f549f09c77e489c0a +Author: Kshitij +Date: Sun Apr 27 22:50:16 2025 +0530 + + Added a simplified method to add data in code A2. + +commit 49a7dec21788b3ad4b8fca30565a9d3269b29f45 +Author: Kshitij +Date: Sun Apr 27 21:30:09 2025 +0530 + + Added code and execution steps for Apache Scala. + +commit 30b722f1542adda38075d2d0eb36e075a12f2b22 +Author: Kshitij +Date: Sun Apr 27 18:36:09 2025 +0530 + + Fixed link for B1 in readme. + +commit 421c9f8ac54ec9357bfeaf66d84fee7f4fb1235d +Author: Kshitij +Date: Sun Apr 27 18:35:10 2025 +0530 + + Added working code and basic steps for Hadoop code, B1. + +commit b57eee1398f3b62d4db1a22d1acc1bdd16db69a0 +Author: Kshitij +Date: Sun Apr 27 16:39:41 2025 +0530 + + Added Term Frequency and Inverse Document Frequency in code A7. + +commit 17f31ed690f73f7c51e1f1311b97df8d41e586d0 +Author: Kshitij +Date: Sun Apr 27 16:24:12 2025 +0530 + + Removed unnecessary # from code A8 + +commit 251ee7148eb21f478baac922aa68c21dfdf28a2a +Author: Kshitij +Date: Sun Apr 27 16:16:57 2025 +0530 + + Made minor changes to simply the code for A7 for better understanding. + +commit f10e2c4cf458cb9f7f69f22aa57b91d03457f3e6 +Author: Kshitij +Date: Sun Apr 27 15:54:25 2025 +0530 + + Changed stop words to punctuation in print statement since that's what's being removed! + +commit 8e13d1febec30a1fbee3dbbbb2599cabbe5923ec +Author: Kshitij +Date: Sun Apr 27 15:47:46 2025 +0530 + + Added nltk in download to fix downloading resources. + +commit 0e95f3d516e0c39ff34100a5ddea4afe572208b6 +Author: Kshitij +Date: Sun Apr 27 00:19:50 2025 +0530 + + Changed from Bri'ish Englishshhhhhhhhhhhhhhhhhh for A9 & A10 imma sleeppppp nowwwwwwwwwwwwwwwwwwwwwwwwwwwwww + +commit 5463200996bd98764b41afe89c19fc8c2d54042c +Author: Kshitij +Date: Sun Apr 27 00:18:31 2025 +0530 + + Added code and notebook for assignment A10 as per ma'am ki Jupyter notebook. + +commit dd2147c6824c055a92ed0be0579e4830c5af6ee3 +Author: Kshitij +Date: Sat Apr 26 23:50:03 2025 +0530 + + Updated readme, added links to all codes and notebooks. + +commit bc5fe2861563bfadcbc80926f79fa4e126a97ae9 +Author: Kshitij +Date: Sat Apr 26 23:46:02 2025 +0530 + + Updated code and notebook for assignment A9 as per ma'am ki Jupyter notebook. + +commit 55bbc8b0951c2921f3536fd0633f18fa949bc32c +Author: Kshitij +Date: Sat Apr 26 23:35:45 2025 +0530 + + Added tab key tip in code A4 & A5 since there's too many packages to import. + +commit 80e99aa32ae7e7cb3de3d9500b279c3becb0ded3 +Author: Kshitij +Date: Sat Apr 26 23:26:56 2025 +0530 + + Added code and notebook for assignment A8 as per ma'am ki Jupyter notebook. + +commit 2d08f21829803b5f4e7ca05bce23cac55fe0097d +Author: Kshitij +Date: Sat Apr 26 23:03:14 2025 +0530 + + Updated code and notebook for assignment A7 as per ma'am ki Jupyter notebook. + +commit 17e109d8a10e2ba6bd230bfd90014cdcf78564a8 +Author: Kshitij +Date: Sat Apr 26 22:33:43 2025 +0530 + + Updated code and notebook for assignment A6 as per ma'am ki Jupyter notebook. Dataset (iris.csv) already present in 'Datasets' dir. + +commit 4aadf165d1b306e21b2ba164aec164e16e323214 +Author: Kshitij +Date: Sat Apr 26 22:24:29 2025 +0530 + + Changed shell to python3 and python3 to shell in markdown code blocks wherever necessary. + +commit d3a436abe684161fee544620016c0b0e0f919adc +Author: Kshitij +Date: Sat Apr 26 01:24:00 2025 +0530 + + Fixed name for assignment A4 notebook. + +commit ae7baee5d30d3ba80c05fa274dff44c67bc00b3a +Author: Kshitij +Date: Sat Apr 26 01:22:12 2025 +0530 + + updated readme bruhhhhhhhhhhhh im exhausted + +commit 583c0e411d69a5dd530d8062903472c28b423715 +Author: Kshitij +Date: Sat Apr 26 01:16:21 2025 +0530 + + Updated code and notebook for assignment A5 as per ma'am ki notebook. + +commit 5dba6ec4ee4eeea7266fd785106eb0114929972e +Author: Kshitij +Date: Sat Apr 26 00:31:48 2025 +0530 + + Updated code and notebook for assignment A2 as per ma'am ki notebook. + +commit db378bccdad684a0f82f814901a882078c616503 +Author: Kshitij +Date: Sat Apr 26 00:05:22 2025 +0530 + + Updated code and notebook for assignment A1 as per ma'am ki notebook. + +commit 96005720d9bf7fb47daf089073e059a7ef298bbc +Author: Kshitij +Date: Fri Apr 25 23:39:16 2025 +0530 + + Added code, dataset and notebook for assignment A3. + +commit 177e062e1ace5643abd1b3bb483c96ab74880dbe +Author: Kshitij +Date: Fri Apr 25 22:55:20 2025 +0530 + + Updated dataset name in dataset dir and in respective codes. + +commit 4e3f7f08c4d14cebee91750ec83062476554ac82 +Author: Kshitij +Date: Fri Apr 25 22:52:57 2025 +0530 + + Added code, dataset and notebook for assignment A4. + +commit cf5eefa9c3fa526e8946dc77678e85412c10ae00 +Author: Kshitij +Date: Wed Apr 23 23:54:58 2025 +0530 + + Made minor modifications to Hadoop setup instructions file. + +commit d653dfff67840753ae5a14b6c0d17b7a63a68939 +Author: Kshitij +Date: Wed Apr 23 09:40:33 2025 +0530 + + Added code, dataset and notebook for assignment A2 (data wrangling 2). + +commit d1a590888e1e53552629c0c2139ffd82cf069bc8 +Author: Kshitij +Date: Wed Apr 23 01:12:12 2025 +0530 + + Added hadoop setup instructions, need to test em today during mock practical tho πŸ˜Άβ€ + +commit 2a0f007e80bd8b5b370f0984d396c608a1e49db1 +Author: Kshitij +Date: Wed Apr 23 00:31:47 2025 +0530 + + Added code, dataset and notebook for assignment A1. + +commit 6dcfa570c753a5a95e093dadb048dc53dfbd3327 +Author: Kshitij +Date: Thu Apr 10 23:08:43 2025 +0530 + + Added write-up for assignment B4. + +commit 8ad6629b7175d3f273c1fc5945e66f194c465deb +Author: Kshitij +Date: Tue Apr 8 11:51:03 2025 +0530 + + Added write-up for assignment B2. + +commit 1db9288cd7eecc56f871cb93c32f9fc7d058dbc9 +Author: Kshitij +Date: Sun Apr 6 12:53:58 2025 +0530 + + Added code for assignment A7 (text analytics) and linked in README. + +commit 9758d9f7115aef1891a47ffd22e50304d46099a9 +Author: Kshitij +Date: Sun Apr 6 11:45:22 2025 +0530 + + Added handwritten notes for unit 1. + +commit 409d77d5b71a8299d161900e473b2e337ed84cc2 +Author: Kshitij +Date: Sat Apr 5 00:58:00 2025 +0530 + + Added write-up for assignment A9. + +commit 117f38a970d10b083d7a2551ceded39d09d6b4f9 +Author: Kshitij +Date: Sat Apr 5 00:51:28 2025 +0530 + + Added write-up for assignment A6. + +commit 4e0476842b682cff9a83afc81d18bc4e5cee399d +Author: Kshitij +Date: Wed Apr 2 21:52:34 2025 +0530 + + Updated readme. + +commit a0778cf8eb0793344dd543f43c86c6b36718eeb5 +Author: Kshitij +Date: Wed Apr 2 21:49:36 2025 +0530 + + Added handout for assignment A9. + +commit 5fa8db3687d93134c84b958cd337f1899a1febe3 +Author: Kshitij +Date: Tue Apr 1 21:14:37 2025 +0530 + + Added write-up for assignment A5. + +commit a1c57770b5d6c383701971bc4306ee1ec6dc7d4a +Author: Kshitij +Date: Fri Mar 28 11:23:07 2025 +0530 + + Added notebook for assignment A5, added link in readme file for it and fixed order for codes section. + +commit c8b3f0028a4dc32d4b8ecc796e9ab55a7cc6a207 +Author: Kshitij +Date: Fri Mar 28 11:21:01 2025 +0530 + + Changed A9 notebook filename to title case. + +commit 8e00bb1a8aaf78b6075e24b1e5bcd1d00d173612 +Author: Kshitij +Date: Fri Mar 28 11:10:57 2025 +0530 + + Changed filenames to title case in code, updated those links in readme and added link for code a5. + +commit 2e4da857cdd1d61cbf84ef2c7d148434042c2152 +Author: Kshitij +Date: Fri Mar 28 11:07:20 2025 +0530 + + Added code for A5 (data analytics 2), i.e. logistic regression. + +commit d0039915597ac126aae14ec4be688820ab3d722e +Author: Kshitij +Date: Fri Mar 28 11:06:01 2025 +0530 + + Added social network ads dataset for assignment A5. + +commit 0f81db5e79d6576448d21ba4571d73c8a8c7f2bd +Author: Kshitij +Date: Thu Mar 27 23:52:58 2025 +0530 + + Added link for unit 4 notes folder. + +commit ecca87fd8244b415cd3020cd2f484830b5be9316 +Author: Kshitij +Date: Thu Mar 27 23:50:00 2025 +0530 + + Added unit 4 ppt. + +commit 9264ae09238a75aeee14f46ae7e21e45af2252e9 +Author: Kshitij +Date: Thu Mar 27 23:41:14 2025 +0530 + + Added notes for unit 3. + +commit 289d4a0b63d8f96151f7fc44d3342f1f079c5cd1 +Author: Kshitij +Date: Thu Mar 27 23:22:10 2025 +0530 + + Added write-up for assignment A7. + +commit cccd5ceea5c7f2283da808900acda60b101f337d +Author: Kshitij +Date: Thu Mar 27 23:13:05 2025 +0530 + + Added write-up for assignment A4. + +commit ed53ba5c13bba6d4804145a2f98b5a7dde66beef +Author: Kshitij +Date: Tue Mar 25 22:33:26 2025 +0530 + + Added March 2025 DSBDA insem question paper. + +commit 30dd0ae331a6b87aad799196002f7eec6183e363 +Author: Kshitij +Date: Mon Mar 3 22:50:03 2025 +0530 + + Added hypothesis testing examples, unit 2 given by ma'am. + +commit 9d98752a6ffeb883fb75e61d693a4f6e558a7e98 +Author: Kshitij +Date: Sat Mar 1 19:32:51 2025 +0530 + + Added unit 2 ppt given by ma'am. + +commit 15294f5e06aa6a8f8c43ae7a3bdd56f53ca77b60 +Author: Kshitij +Date: Wed Feb 19 00:00:24 2025 +0530 + + Added write-up for B1. + +commit 0ebd2b46f80250064f37cb062e4ed4613a082c5d +Author: Kshitij +Date: Fri Feb 14 11:42:54 2025 +0530 + + Added code for A10 (data visualization 3) + +commit 8a29d83a3c9c58891572d3c0237e552a51f87af7 +Author: Kshitij +Date: Wed Feb 12 23:11:15 2025 +0530 + + Added write-up for assignment-a10. Written by Kalaskar. + +commit ab11a51ecb1bf88b1c7bbca64cc9353b5d8100f2 +Author: Kshitij +Date: Wed Feb 12 23:10:32 2025 +0530 + + Added write-up for assignment-a3. Written by Mohit! + +commit 05ef5419e675366c84aea0931c54bda2a230be27 +Author: Kshitij +Date: Wed Jan 29 21:55:20 2025 +0530 + + Updated names of write-up A1 and A2 to match with write-up filenames in other repos. + +commit 1bba1ea0e7c7ced90bfaa61fff59932868cfb1c6 +Author: Kshitij +Date: Wed Jan 29 21:54:19 2025 +0530 + + Added write-up for assignment 8. + +commit 252051e5c1b631089e2158b5bd38f31827e575fa +Author: Kshitij +Date: Wed Jan 29 14:42:05 2025 +0530 + + Added ppt for unit 1. Provided by ma'am. + +commit 9f19d2d65487a7b914812556c8195a38df94731d +Author: Kshitij +Date: Fri Jan 24 11:29:21 2025 +0530 + + Fixed file extension for notebook a9. + +commit 64cc0a7bb55a13e8158e2ce7eae6c99d0e86d9d7 +Author: Kshitij +Date: Fri Jan 24 11:28:08 2025 +0530 + + Fixed link in readme. + +commit 0036698a722c5f820b513b0d82c602b822bae353 +Author: Kshitij +Date: Fri Jan 24 11:26:10 2025 +0530 + + Following Amerian english for title since that's what's in the syllabus file. Also, added links in readme. + +commit 79dd2b3ec888137a44d39cde9a36d57d97e0985e +Author: Kshitij +Date: Fri Jan 24 11:18:33 2025 +0530 + + Added code for a9. + +commit cdbc080f14c9d4009af91bd326870d229b1bea2c +Author: Kshitij +Date: Fri Jan 24 11:18:22 2025 +0530 + + Moved notebook a9 to notebooks folder and changed its name for better understanding. + +commit ae53d3ec6f7fcd3529d2684b1a45b391ebabc120 +Author: Kshitij +Date: Fri Jan 24 11:08:34 2025 +0530 + + Added jupyter notebook for A9. + +commit 8c47f6097ddec0247cf73bc3d9d713380d1b947e +Author: Kshitij +Date: Tue Jan 21 22:34:45 2025 +0530 + + Added write-up for assignment-2. + +commit 5627914da6ce12d42eacd4d40478acd2babd6f09 +Author: Kshitij +Date: Sat Jan 18 01:08:12 2025 +0530 + + Added links for assignment folders in readme. + +commit af0d3e7e6a1a638faac8de631a4e7ec219c3249b +Author: Kshitij +Date: Sat Jan 18 01:06:16 2025 +0530 + + Added handouts (A2-A8, A10, B1, B2, B4) + +commit 3f71e805c8f1b0d4504ddb37be77952eb92642c6 +Author: Kshitij +Date: Thu Jan 16 21:08:16 2025 +0530 + + Added write-up for assignment-a1. Thanks to Ayush Kalaskar for writing it! + +commit 63a36a2e95d61a35d9099ff5731bf4df0b4fd2ce +Author: Kshitij +Date: Thu Jan 16 21:04:35 2025 +0530 + + Added handout for assignment-a1. + +commit 6deb1a343af8f06f204b33076773655438cfd0b9 +Author: Kshitij +Date: Thu Jan 16 00:33:26 2025 +0530 + + Added IN-SEM and END-SEM question papers (2022, 2023, 2024). + +commit 32c39ea7803f519f984c0c785c2d5d87f51b379f +Author: Kshitij +Date: Wed Jan 8 16:00:12 2025 +0530 + + Added all the required sections in README file. + + - Contribution tip + - Index section + - Miscellaneous section + +commit 50d6d49ccc813c323dc50b33ab60c39ff13d29c5 +Author: Kshitij +Date: Wed Jan 8 15:57:53 2025 +0530 + + Added DISCLAIMER and motto file. + +commit bf2e909b410b55bc2a7d01cc0c16a86afcd4f5af +Author: Kshitij +Date: Wed Jan 8 15:57:08 2025 +0530 + + Initial commit.