Somanath Kshirsagar

- Data Scientist -
I'm

About Me
Shrinivas Khiste

Somanath Kshirsagar

Post-Graduated Student @ DS Department, Fergusson College, Pune.

πŸ‘‹ Hey there! I'm Somnath Kshirsagar! 🌟 I feel incredibly grateful to have completed my Masters in Data Science from Fergusson College πŸŽ“. Being a part of Fergusson College has been an amazing journey for me, and I cherish the memories I've made there. πŸŽ‰ I did my bachelor's degree in Computer Science from Savitribai Phule Pune University πŸŽ“.

I find pure joy in creating end-to-end Data Science projects that solve real-world problems πŸš€.During my academic journey, I had the wonderful opportunity to intern at ERP Launchpad Company, where I delved deep into the realm of Data Analytics for six months πŸ’Ό. Additionally, I embarked on an exciting three-month internship in the NLP Domain at iNeuron.ai, where I worked on an intriguing bank chatbot project πŸ’¬.

My skill set includes a wide range of tools and technologies πŸ› οΈ. I'm well-versed in Machine Learning πŸ€–, Deep Learning 🧠, Natural Language Processing πŸ“š, Computer Vision πŸ‘οΈ, Python programming 🐍, PowerBI πŸ“Š, SQL πŸ—„οΈ, Git & GitHub πŸ™, Docker 🐳, CI/CD pipelines πŸ”„, and AWS Sagemaker ☁️, among others πŸ’». These are the building blocks that empower me to turn data into valuable insights and actionable solutions. πŸ“ˆ I'm incredibly passionate about leveraging my knowledge and skills in Data Science to make a meaningful impact in the world. ✨ Connecting with fellow enthusiasts and professionals in this field is something I truly look forward to, as we embark on this exciting journey of discovery and innovation. 🌐 Thank you for taking the time to get to know me a little better! πŸ™ Let's create data-driven magic together! ✨

Get to know more about my :

Skills Projects Work Experience Resume

Skills

Below are some of my skills, and I'm always excited to learn more.

Python

Machine Learning

Deep Learning

NLP

OpenCV

Hugging Face

C/C++

Tenser Flow

AWS

AWS Lambda

AWS SageMaker

CICD Pipeline

Docker

Git

Github

Google Data Studio

numpy

Pandas

Matplotlib

Seaborn

Power BI

MySql

StreamLit

PyTorch

Transformer Learning

Excel

Rest & Fast APIs

PyCharm

VSCode

Sci-Kit-Learn

My Education

* An investment in knowledge pays the best interest.

Fergusson College, Pune.

Master's Degree in Data Science.
Jun 2021 - July 2023.

I just completed my data science master's degree at Fergusson College, and I feel incredibly grateful to have been a part of this prestigious institution. πŸŽ“ This year, 2023, signifies the end of my postgraduate studies at Fergusson College. ✨ During my time there, I actively engaged in both academic pursuits πŸ“š and extracurricular activities πŸŽ‰.

Now, I am eagerly looking forward to the next chapter of my life, confident that my experiences and skills gained at Fergusson College have prepared me well for success. πŸ’ͺ🌟

T.C. College, Baramati.

Bachelor's Degree in Computer Science (BCS).
Jun 2018 - May 2021.

πŸŽ“ I take great pride in my time as a student at TC College in Baramati, where I pursued my UG degree in B.Sc. Computer Science. βœ… The year 2021 marked the successful completion of my undergraduate journey.

Throughout my academic tenure, I actively engaged in a diverse range of extra-curricular activities, augmenting my overall growth. 🌟 I am immensely grateful for the opportunity to be a part of such a highly esteemed institution, which has shaped me into who I am today. πŸ™πŸΌπŸ«

Projects

Here are some of the projects that I have worked on

  • All
  • Computer Vision
  • Machine Learning
  • NLP

Health Adviser

Random Forest | ResNet | Logistic Regression | Streamlit | CI/CD Pipeline | Docker | Render.
  • I'm excited to share the successful completion of the Health Advisor project, where I integrated cutting-edge analytics techniques like Random Forest, ResNet, and Logistic Regression.
  • By leveraging Streamlit, I developed an intuitive interface, and utilizing GitHub, I established an advanced data-driven solution for disease diagnosis, early detection, and pneumonia assessment.
  • To optimize efficiency and deployment, I implemented a CI/CD pipeline, Dockerized the project, and deployed it on the Render cloud platform. This project has the potential to revolutionize healthcare outcomes, bridging the gap between technology and well-being. πŸš€πŸ’‘πŸ“ŠπŸ’»

least Square

Machine Learning | Regression | Linear | Polynomial | Lasso | Ridge | Streamlit | CI/CD Pipeline | Docker | Render.
  • πŸš—πŸ’° The LeastSquare project πŸ“ŠπŸ”¬ developed a Streamlit WebApp using regression models to predict used car prices. By scraping data from Cars24.com πŸŒπŸš€, performing data cleaning and feature engineering, the project achieved accurate predictions.
  • Utilizing advanced regression techniques, hyperparameter tuning, CI/CD Piepline and Dockerization 🐳, the project was deployed on the Render cloud platform β˜οΈπŸš€.
  • The user-friendly interface πŸŒŸπŸ“² allowed users to interact with the model and make informed decisions in the used car market. Let's drive towards better predictions! πŸ’ͺπŸ”πŸ’‘

Bag-Of-Vectors

Machine Learning | Spacy | Random Forest | Logistic Regression | Streamlit | NLP | CI/CD Pipeline | Docker | Render.
  • πŸ” In this project, we aimed to determine the authenticity of job descriptions by utilizing a bag-of-vectors approach. πŸ“Š Exploratory data analysis was conducted to gain insights into the dataset.
  • πŸ“ Text preprocessing was performed using Spacy, followed by the conversion of text into count and TF-IDF vectors. πŸŒ²πŸ”€ Random Forest and Logistic Regression models were fitted to the data for classification purposes.
  • The project leveraged πŸ€– machine learning techniques, πŸ“š NLP, and tools such as Spacy and Streamlit to develop an effective job authenticity prediction system.

Health Insurance Cost Prediction

Python | machine learning | Random Forest Regressor | GradientBoostingRegressor | CI/CD Pipeline | Docker | Render.
  • πŸš€ We developed a health insurance cost prediction project using Python and machine learning techniques. πŸ“ŠπŸ§ͺ By analyzing and preprocessing the data, we trained Random Forest 🌳 and GradientBoostingRegressor πŸ“ˆ models.
  • The GradientBoostingRegressor model achieved an impressive 95% accuracy βœ”οΈβ­οΈ in predicting health insurance costs. πŸ’°πŸ’‰ This project demonstrates the power of machine learning algorithms in accurately estimating healthcare expenses.
  • πŸ’ͺ The code and further details can be found on GitHub. πŸ™ Good luck with your health insurance cost predictions, and feel free to ask any questions! β“πŸ’Ό

Text Summarization using AWS SageMaker

Python | NLP | Machine Learning | AWS Sagemaker | S3 Bucket | EC2 Instance | CI/CD Pipeline | Docker.
  • The End-to-end Text Summarizer project aims to develop a comprehensive solution for generating concise summaries from text data.
  • The project leverages AWS services and follows a structured workflow to ensure efficient summarization of information.

Harvestify 🌿

Python | Machine Learning | Deep Learning | Computer Vision | RESNET | PyTorch | CI/CD Pipeline | Docker | Render.
  • A simple ML and DL based website which recommends the best crop to grow, fertilizers to use and the diseases caught by your crops.
  • In the crop recommendation application, the user can provide the soil data from their side and the application will predict which crop should the user grow.
  • For the fertilizer recommendation application, the user can input the soil data and the type of crop they are growing, and the application will predict what the soil lacks or has excess of and will recommend improvements.
  • For the last application, that is the plant disease prediction application, the user can input an image of a diseased plant leaf, and the application will predict what disease it is and will also give a little background about the disease and suggestions to cure it.

Experience

A short summary of my Work Experience...

  • ERP launchpad
    Jan 2023 - June 2023

    ERP Launchpad,
    Pune, India

    Proficient in Excel for data analysis, reporting, and presentation. Skilled in advanced functions like VLOOKUP, IF, SUMIF, and COUNTIF. Utilized Excel for financial analysis, forecasting, and budgeting. Experienced in creating visually appealing reports and dashboards using Google Data Studio, connecting multiple data sources like Google Analytics and Google Sheets. Able to create custom metrics, dimensions, and filters for customized reports and dashboards.

  • iNeuron.ai
    Sept 2022 - Nov 2022

    iNeuron Intern,
    Banglore, India

    During my three-month internship at iNeuron.ai, I had the opportunity to work on an exciting Bank Chatbot project called "Shera" πŸ€–πŸ’¬. The chatbot was designed to provide accurate responses to customer queries about the bank's products and services, including account opening, loans, and digital offerings πŸ’ΌπŸ’°πŸŒ. Utilizing a custom dataset, we trained the chatbot using deep learning techniques integrated with the PyTorch framework 🧠πŸ”₯. Employing #NLP methods such as stemming, tokenization, and lemmatization, Shera effectively understood and responded to a wide range of customer questions πŸ“šπŸ’‘.
    #DL #NLP #AI #Chatbot #BankingTechnology

Contact Me

Email: somanathtk198@gmail.com
Phone: 9518710423

...or use the following form