Samples Resume AWS
Samples Resume AWS
Samples Resume AWS
A Certified AWS Sr. Data Scientist with lots of work experience in Machine Learning, Artificial
Intelligence, Python, PySpark and with over 10+ years of professional experience in Machine
Learning, Statistical Modeling, Deep Learning, Data Analytics, Data Modeling, Data
Architecture, Data Analysis, Data Mining, Text Mining & Natural Language Processing (NLP),
Artificial Intelligence/Machine Learning algorithms, Business Intelligence.
Experienced in building Analytics Models like Decision Trees, Linear & Logistic Regression,
Hadoop (Hive, PIG), R, Python, Spark, Scala, MS Excel, SQL and PostgreSQL, Erwin.
Comfortable with R, Python, SAS and Weka, MATLAB, Relational databases. Deep
understanding & exposure of Big Data Eco-system.
Expertise in Data Analysis, Data Migration, Data Profiling, Data Cleansing, Transformation,
Integration, Data Import, and Data Export through the use of multiple ETL tools such as
Informatica Power Center.
Good Knowledge and experience in deep learning algorithms such as Artificial Neural network
(ANN), Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN), LSTM
and RNN based speech recognition using TensorFlowGood Knowledge of Natural Language
Processing (NLP) and Time Series Analysis and Forecasting using ARIMA model in Python
and R.
Hands on experience in Hadoop ecosystem and Spark framework such as HDFS,
MapReduce, HiveQL, SparkSQL, PySpark.
Experience in building visual data processing pipelines, including the design of necessary
validation automation, Well versed in problem solving, debugging, and troubleshooting skills.
Technical Skills:
Languages Python (2.x/3.x), R, SAS, SQL, T-SQL
AWS Cloud AWS Services Amazon EC2, Amazon S3, Amazon Simple DB,
Amazon MQ, Amazon ECS, Amazon Lambdas, Amazon Sagemaker,
Amazon RDS, Amazon Elastic Load Balancing, Elastic Search,
Amazon SQS, AWS Identity and access management, AWS Cloud
Watch, Amazon EBS and Amazon CloudFormation.
Databases MySQL, PostgreSQL, Oracle, HBase, Amazon Redshift, MS SQL
Server 2016/2014/2012/2008 R2/2008, Teradata
Statistical Methods Hypothetical Testing, ANOVA, Time Series, Confidence Intervals,
Bayes Law, Principal Component Analysis (PCA), Dimensionality
Reduction, Cross-Validation, Autocorrelation
AI/ML Regression analysis, Bayesian Method, Decision Tree, Random
Forests, Support Vector Machine, Neural Network, Sentiment
Analysis, K-Means Clustering, KNN and Ensemble Method
Hadoop Ecosystem Hadoop 2.x, Spark 2.x, MapReduce, Hive, HDFS, Sqoop, Flume
Reporting Tools Tableau Suite of Tools 10.x, 9.x, 8.x which includes
Desktop, Server and Online, Server Reporting Services (SSRS)
Data Analytics Tools Python (NumPy, SciPy, pandas, Genism, Keras), R (Caret, Weka,
ggplot)
Data Visualization Tableau, Visualization packages, Matplotlib, Seaborn, ggplot2,
Microsoft Office
Operating Systems PowerShell, UNIX/UNIX Shell Scripting , Linux and Windows
Certifications:
AWS Certified Solutions Architect Associate.
Google Certified Tensor Flow Developer.
Professional Experience: