About me

Hello! I'm Pavan Reddy Mogili, an impassioned technophile specializing in Data Science. I am currently enhancing my proficiency through a Master's degree in Data Science at Northeastern University, with my graduation lined up for May 2023.

My journey in the field started with roles at Cognizant Technology Solutions and a Data Scientist Internship at Affinia Therapeutics. At Northeastern University, I furthered my art of teaching others as a Teaching Assistant for Data Science.

I am passionate about delving into projects that challenge my knowledge of Machine Learning, Data Engineering, and Algorithms. My aim is to design data solutions that make intricate operations accessible and user-friendly, aiding businesses to harness data for informed decisions.

I can be reached at [email protected] or +16174072912. I'm currently looking for thrilling job opportunities that allow me to expand my skills and contribute to path-breaking projects. If my skills resonate with your requirements, let's get in touch!

What i'm doing

  • design icon

    Machine Learning

    During my tenure at Affinia and Northeastern University, I specialized in natural language processing models and honed my ability to teach complex machine learning concepts. I successfully achieved a 90% accuracy rate in entity matching and relationship extraction, proving my mettle in various data science projects using a multitude of machine learning and statistical methods.

  • Data Engineering

    With a proven track record in data engineering at Affinia, Northeastern University, and Cognizant Technology Solutions, I've developed robust natural language processing models, automated data capture, and storage solutions. I've also optimized data storage and retrieval using both SQL and NoSQL databases. My commitment to data quality, efficiency, and the transformation of raw data into insightful information is evident in my portfolio.

  • Data Analysis

    Across my roles at Affinia, Northeastern University, and Cognizant Technology Solutions, I've fine-tuned my skills in data analysis. I've pioneered the development of automated data capture, storage, and visualization platforms, in addition to optimizing SQL queries and database indexing. My talent lies in transforming raw data into actionable insights, a quality that has consistently aided in making informed, data-driven decisions.

Resume

Education

  1. Northeastern University

    2021 — 2023

    Master of Science in Data Science

    Relevant Coursework: Supervised Machine Learning, Unsupervised Machine Learning, Natural Language Processing, Algorithms, Deep Learning, Information Retrieval, Data Mining, Data Engineering, Data Science Capstone

    Activities and societies: The DATA club, The DAEC, Husky Competitive Programming Club

  2. Jawaharlal Nehru Technological University

    2015 — 2019

    Bachelor of Technology in Electronics and Communication Engineering

    Relevant Coursework: Data Structures and Algorithms, Python, Database Management System, Neural Networks, Operating Systems, Software Engineering, Computer Architecture

    Activities and societies: The Robotics Club, The Data Science Club, NSS, Sreenidhi Computer Science Club

    Vice President of The Electronix Club

Experience

  1. Affinia Therapeautics - Data Science Intern

    June 2022 — December 2022

    • Devised a natural language processing model using Python, accomplishing a notable 90% success rate in precise entity matching and relationship extraction tasks.

    • Innovated data extraction and cleansing techniques by implementing a web crawler using Python and Scrapy, enhancing data acquisition speed by an impressive 40%.

    • Collaborated in the design and creation of an automated machine learning pipeline encompassing data preprocessing, model training, and prediction processes.

    • Executed comprehensive data analysis utilizing SQL and Python libraries such as NumPy, Pandas, and Matplotlib, delivering crucial insights and recommendations to stakeholders.

  2. Northeastern University - Teaching Assistant (Data Science)

    January 2022 — June 2022

    • Assisted a class of 40 students in Python programming for a Machine Learning course, helping them understand supervised and unsupervised machine learning concepts.

  3. Cognizant Technology Solutions - Full Stack Developer & Data Scientist

    January 2019 — July 2021

    • Optimized data transformation processes and prototyped machine learning models employing Python, Docker, and AWS technologies, generating significant cost savings of $100,000 for the organization.

    • Engineered an advanced data examination solution using Hadoop and SQL, skyrocketing quarterly report generation efficiency by an astonishing 70%.

    • Spearheaded a team of five engineers to construct and maintain scalable data pipelines using Apache Kafka, amplifying data accessibility and integrity across diverse projects.

    • Instituted a robust strategy for collecting, cleaning, and managing data from various sources, curtailing data inconsistency by 50%.

    • Fostered a strong collaborative relationship with the data science team to translate their requirements into solid data engineering solutions, ensuring the seamless functioning of their machine learning applications.

    • Assisted in the development of a data-centric e-commerce platform handling metadata for over 10,000 products, leveraging data engineering techniques to ensure smooth data flow.

    • Applied machine learning algorithms to improve the personalization of user experiences and implemented efficient data pipeline solutions.

    • Designed and developed a data visualization platform using Tableau and PowerBI, enabling stakeholders to make data-driven decisions.

My skills

  • Languages: Python, R, C, C++, Java, JavaScript, HTML, SQL
    SQL ans NoSQL Databases: MySQL, MongoDB, PostgreSQL, Cassandra, Neo4j
    Libraries and Frameworks: NumPy, Pandas, Scikit-learn, Hadoop, PySpark, TensorFlow, Keras, Matplotlib, dplyr, ggplot2, NLTK, openCV2, XGBoost, SpaCy, Django, Seaborn, Spring Boot, Flask, Angular, React, PySpark
    Technologies and Platforms: AWS, Computer vision, NLP, CNN, Git, Docker, Apache Cassandra, Elasticsearch, Apache Kafka, Tableau, PowerBI, Alteryx, Excel

Portfolio