Kareemuddin M.
Skills
- Languages: Python (pandas, NumPy), SQL, R, Linux Bash - Frameworks & Libraries: Scikit-learn, Statsmodels, Matplotlib, Seaborn, Plotly, Kafka, PySpark, Apache Spark - Cloud & Data Engineering: Databricks, AWS (S3, Glue, Redshift), Azure Data Factory, Azure Synapse, Google BigQuery - Database: SQL Server, PostgreSQL, MySQL, Redshift, Snowflake - ETL & Data Pipelines: dbt (Data Build Tool), Airflow, Fivetran, Stitch, Python-based ETL workflows, REST APIs - Visualization Tools: Power BI (DAX, Power Query), Tableau, Looker, Excel (Advanced Excel, Power Query, Macros, VBA) - Tools: Git, Jupyter Notebook, VS Code, Excel (Pivot, VBA), REST APIs - Data Warehousing: Python-based ETL workflows, Airflow, AWS S3 & Glue, SQL-based data integration, Data Modeling - Machine Learning & Analytics: Predictive Modeling, A/B Testing, Regression, Classification, Anomaly Detection, Statistical Analysis - SQL Expertise: Query Optimization, Stored Procedures, Performance Tuning, Data Joins, Window Functions - Project & Collaboration: Agile/Scrum, Jira, Confluence, Documentation of Pipelines & Workflows, KPI Dashboards
About
Data Analyst with 5+ years of experience in data integration, ETL, analytics, and dashboard development across IT and financial services.
Proficient in Python, SQL, Power BI, and cloud-based analytics using AWS, Azure, and Databricks. Skilled in data modeling, machine learning,
and automated KPI reporting. Experienced in building modern data pipelines using tools like dbt, Fivetran, and Snowflake to translate raw data
into actionable insights that drive operational efficiency and business decision-making.