0% found this document useful (0 votes)

3 views3 pages

HTML Code

The document is a comprehensive reference guide for essential commands in Python, PySpark, and SQL for developers. It includes command functions and examples for each programming language, providing quick access to key functionalities. The guide is version 2.0 and was updated in August 2024.

Uploaded by

wifaba2026

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views3 pages

HTML Code

Uploaded by

wifaba2026

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Ultimate Developer Command Guide

Python, PySpark & SQL Reference

Essential Commands for Developers

Python Commands

Command Function Example

print() Outputs data to console print("Welcome to Python!") # Prints:

Welcome to Python!

len() Returns length of an object my_list = [1, 2, 3]; print(len(my_list))

# Output: 3

range() Generates a sequence of numbers for i in range(3): print(i) # Prints: 0,

1, 2

def Defines a custom function def greet(name): return f"Hello,

{name}"; print(greet("Alice")) # Prints:
Hello, Alice

import Imports a module or library import math; print(math.pi) # Prints:

3.141592653589793

[x for x in Creates a list using comprehension squares = [x**2 for x in [1, 2, 3]];
iterable] print(squares) # Prints: [1, 4, 9]

if/elif/else Conditional logic x = 10; if x > 5: print("Big") else:

print("Small") # Prints: Big

for Iterates over a sequence for fruit in ["apple", "banana"]:

print(fruit) # Prints: apple, banana

while Loops until condition is false count = 0; while count < 3:

print(count); count += 1 # Prints: 0, 1,
2

try/except Handles exceptions try: print(1/0) except

ZeroDivisionError: print("Cannot divide
by zero") # Prints: Cannot divide by
zero

open() Opens a file for reading/writing with open("example.txt", "w") as f:

f.write("Hello") # Creates file with
text

list.append() Adds an item to a list my_list = []; my_list.append(5);

print(my_list) # Prints: [5]

dict.get() Retrieves value from dictionary my_dict = {"key": "value"};

print(my_dict.get("key")) # Prints:
value

PySpark Commands
Command/Function Function Example

SparkSession.builder Initializes a Spark from pyspark.sql import SparkSession; spark =

session SparkSession.builder.appName("MyApp").getOrCreate()

spark.read.csv() Loads CSV file into df = spark.read.csv("data.csv", header=True,

a DataFrame inferSchema=True); df.show() # Displays CSV data

df.show() Displays first n df.show(3) # Shows first 3 rows

rows of DataFrame

df.printSchema() Displays df.printSchema() # Shows column names and types

DataFrame schema

df.select() Selects specific df.select("name", "age").show() # Shows name and

columns age columns

df.filter() Filters rows based df.filter(df.age > 25).show() # Shows rows where
on condition age > 25

df.where() Alias for filter df.where("salary > 50000").show() # Filters rows

where salary > 50000

df.groupBy().agg() Groups data and df.groupBy("department").agg({"salary":

applies aggregation "avg"}).show() # Shows avg salary per dept

df.join() Joins two df1.join(df2, df1.id == df2.id, "inner").show() #

DataFrames Inner join on id

df.withColumn() Adds or modifies a df.withColumn("age_plus_10", df.age + 10).show() #

column Adds column with age + 10

df.withColumnRenamed() Renames a column df.withColumnRenamed("old_name", "new_name").show()

# Renames column

df.drop() Drops specified df.drop("salary").show() # Drops salary column

columns

df.fillna() Replaces null df.fillna({"age": 0}).show() # Replaces null ages

values with 0

df.dropDuplicates() Removes duplicate df.dropDuplicates(["name"]).show() # Drops

rows duplicate names

df.write.csv() Saves DataFrame df.write.csv("output.csv", mode="overwrite") #

as CSV Saves DataFrame to CSV

df.createOrReplaceTempView() Registers df.createOrReplaceTempView("temp_table") # Creates

DataFrame as SQL SQL view
table

spark.sql() Runs SQL query on spark.sql("SELECT name FROM temp_table WHERE age >
DataFrame 30").show() # Runs SQL query

Window.partitionBy() Defines window for from pyspark.sql.window import Window; w =

ranking/aggregation Window.partitionBy("dept").orderBy("salary");
df.withColumn("rank", row_number().over(w)).show()
# Adds rank column
SQL Commands

Command Function Example

SELECT Retrieves data from a table SELECT name, age FROM employees #
Selects name and age columns

WHERE Filters rows based on condition SELECT * FROM employees WHERE age > 30 #
Filters employees older than 30

ORDER BY Sorts result set SELECT * FROM employees ORDER BY salary

DESC # Sorts by salary in descending
order

GROUP BY Groups rows for aggregation SELECT department, AVG(salary) FROM

employees GROUP BY department # Avg
salary per dept

HAVING Filters grouped results SELECT department, COUNT(*) FROM

employees GROUP BY department HAVING
COUNT(*) > 5 # Depts with > 5 employees

JOIN Combines rows from multiple tables SELECT e.name, d.dept_name FROM
employees e JOIN departments d ON
e.dept_id = d.id # Joins tables

LEFT JOIN Includes all rows from left table SELECT e.name, d.dept_name FROM
employees e LEFT JOIN departments d ON
e.dept_id = d.id # Left join

LIMIT Restricts number of returned rows SELECT * FROM employees LIMIT 5 #

Returns first 5 rows

INSERT INTO Adds new rows to a table INSERT INTO employees (name, age) VALUES
('Alice', 28) # Inserts a new employee

UPDATE Modifies existing rows UPDATE employees SET salary = 60000

WHERE name = 'Alice' # Updates salary

DELETE Removes rows from a table DELETE FROM employees WHERE age < 18 #
Deletes rows where age < 18

CREATE TABLE Creates a new table CREATE TABLE employees (id INT, name
VARCHAR(50), age INT) # Creates
employees table

ALTER TABLE Modifies table structure ALTER TABLE employees ADD COLUMN salary
DECIMAL(10,2) # Adds salary column

DROP TABLE Deletes a table DROP TABLE employees # Deletes employees

table

Cheat Sheet Summary

Comprehensive reference for Python, PySpark, and SQL development tasks.
Version 2.0 | Updated: August 2024

Print Tip: Use Ctrl+P (Win) / Cmd+P (Mac) to save as PDF

Pyspark Basics
No ratings yet
Pyspark Basics
16 pages
Software Design Description School Fees Managemnt System
No ratings yet
Software Design Description School Fees Managemnt System
36 pages
SQL Cheat Sheet
0% (1)
SQL Cheat Sheet
2 pages
Data Science Tools Study Guides For MIT's 15.003
No ratings yet
Data Science Tools Study Guides For MIT's 15.003
23 pages
HTML Code
No ratings yet
HTML Code
4 pages
Comparison of SQL
No ratings yet
Comparison of SQL
11 pages
SQL Cheat Sheet Python
100% (1)
SQL Cheat Sheet Python
1 page
Pyspark Syntax Using Simple Examples
No ratings yet
Pyspark Syntax Using Simple Examples
28 pages
Python CheatSheet
No ratings yet
Python CheatSheet
2 pages
Practical
No ratings yet
Practical
12 pages
Spark Essentials
No ratings yet
Spark Essentials
15 pages
Data Analyst Cheat Sheet
No ratings yet
Data Analyst Cheat Sheet
28 pages
Pyspark Cheatsheet
No ratings yet
Pyspark Cheatsheet
21 pages
Pandas Dataframe All Operations 1735471870
No ratings yet
Pandas Dataframe All Operations 1735471870
4 pages
IP Imp Notes
No ratings yet
IP Imp Notes
5 pages
Python & MySQL For Data Analysis
No ratings yet
Python & MySQL For Data Analysis
45 pages
Rajni Ip File Final
No ratings yet
Rajni Ip File Final
42 pages
Informatics Practices Practical File
No ratings yet
Informatics Practices Practical File
8 pages
DEBasic Test Que NAns
No ratings yet
DEBasic Test Que NAns
15 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Question Bank-BDA (Module 1&2) 2
No ratings yet
Question Bank-BDA (Module 1&2) 2
5 pages
Pyspark and SQL
No ratings yet
Pyspark and SQL
57 pages
Pandas PDF
No ratings yet
Pandas PDF
25 pages
Data Analytics Using Python
No ratings yet
Data Analytics Using Python
10 pages
Super Study Guide: Data Science Tools: Afshine Amidi and Shervine Amidi August 21, 2020
No ratings yet
Super Study Guide: Data Science Tools: Afshine Amidi and Shervine Amidi August 21, 2020
23 pages
Deloitte Data Engineer Interview Experience (0-3 Yoe)
No ratings yet
Deloitte Data Engineer Interview Experience (0-3 Yoe)
22 pages
Python 2.1.3
No ratings yet
Python 2.1.3
6 pages
Ip File Class 12
No ratings yet
Ip File Class 12
26 pages
Pyspark SQL Basics Cheat Sheet: Python For Data Science
No ratings yet
Pyspark SQL Basics Cheat Sheet: Python For Data Science
1 page
Apache Spark Builtin Functions
No ratings yet
Apache Spark Builtin Functions
9 pages
I.P File
No ratings yet
I.P File
20 pages
Pandas Moderate
No ratings yet
Pandas Moderate
15 pages
Unit 4 Spark SQL
No ratings yet
Unit 4 Spark SQL
49 pages
Panda
No ratings yet
Panda
39 pages
Pyspark Distinct and Filter
No ratings yet
Pyspark Distinct and Filter
3 pages
Databricks Vs SQL Cheat Sheet
No ratings yet
Databricks Vs SQL Cheat Sheet
11 pages
Razorpay Data Analyst Interview Questions 1739977522
No ratings yet
Razorpay Data Analyst Interview Questions 1739977522
12 pages
NTU AB0403 Quiz Notes
No ratings yet
NTU AB0403 Quiz Notes
18 pages
SQL & Python Interview Q&A
No ratings yet
SQL & Python Interview Q&A
7 pages
Data Handling Module
No ratings yet
Data Handling Module
10 pages
Python Practical List 24
No ratings yet
Python Practical List 24
6 pages
Questions For Preparation
No ratings yet
Questions For Preparation
9 pages
Chapter-2 Python Pandas
100% (2)
Chapter-2 Python Pandas
33 pages
Pandas - Programs
No ratings yet
Pandas - Programs
22 pages
Data Engineering for Beginners
No ratings yet
Data Engineering for Beginners
129 pages
Pandas
No ratings yet
Pandas
13 pages
Data Analtycs Professional-1
No ratings yet
Data Analtycs Professional-1
15 pages
Pandas Cheat Sheet - Python For Data Science
No ratings yet
Pandas Cheat Sheet - Python For Data Science
5 pages
Deloite Data Engineer Interview Questions
No ratings yet
Deloite Data Engineer Interview Questions
24 pages
Essential SQL Queries Reference Guide
No ratings yet
Essential SQL Queries Reference Guide
8 pages
Class 12 IP Practical Record
No ratings yet
Class 12 IP Practical Record
33 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
Practical 2024
No ratings yet
Practical 2024
10 pages
Ip Sample Paper 6 Answer Key
No ratings yet
Ip Sample Paper 6 Answer Key
6 pages
Wipro Data Analyst Interview Questions
No ratings yet
Wipro Data Analyst Interview Questions
29 pages
Pyspark - DataFrame Window Functions
No ratings yet
Pyspark - DataFrame Window Functions
3 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Simplifying Data Science With Python
From Everand
Simplifying Data Science With Python
Billy David millican
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
C++ Functions and tutorial
From Everand
C++ Functions and tutorial
Nino Paiotta
No ratings yet
Java Programming Tutorial With Screen Shots & Many Code Example
From Everand
Java Programming Tutorial With Screen Shots & Many Code Example
Desmond Ohwofosirai
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Data Warehouse IMP QUESTIONS CS3551
No ratings yet
Data Warehouse IMP QUESTIONS CS3551
2 pages
Mark Java Resume
No ratings yet
Mark Java Resume
4 pages
Oracle Quiz Answer
No ratings yet
Oracle Quiz Answer
2 pages
XENA XER Import Utility Install Guide
No ratings yet
XENA XER Import Utility Install Guide
30 pages
DBMS - File Organization, Indexing and Hashing Notes
No ratings yet
DBMS - File Organization, Indexing and Hashing Notes
19 pages
Database Management System 6: ER Modeling..
No ratings yet
Database Management System 6: ER Modeling..
13 pages
Hospital Management System Proposal
No ratings yet
Hospital Management System Proposal
11 pages
Jolo Cloud API: Domestic Money Transfer: Table of Content
No ratings yet
Jolo Cloud API: Domestic Money Transfer: Table of Content
16 pages
Logical Operators - SQL (Samples)
No ratings yet
Logical Operators - SQL (Samples)
9 pages
db2 SQL Procedural Lang 115
No ratings yet
db2 SQL Procedural Lang 115
114 pages
Student Result Managment
No ratings yet
Student Result Managment
13 pages
Mca (Management) 2019 Pattern
No ratings yet
Mca (Management) 2019 Pattern
67 pages
Az-305 6
No ratings yet
Az-305 6
40 pages
Supplier CASE STUDY
0% (1)
Supplier CASE STUDY
3 pages
Chapter 6 - KM Design and Development
No ratings yet
Chapter 6 - KM Design and Development
45 pages
Import Project Report
No ratings yet
Import Project Report
150 pages
Big Data and Business Analytics: Lab Manual
100% (1)
Big Data and Business Analytics: Lab Manual
45 pages
Lab 9 Tasks Subqueries - Solved
No ratings yet
Lab 9 Tasks Subqueries - Solved
9 pages
CSV 21
No ratings yet
CSV 21
15 pages
Chapter 2: Data Science
No ratings yet
Chapter 2: Data Science
32 pages
Spring Transaction Management Transactional in Depth - Adoc
No ratings yet
Spring Transaction Management Transactional in Depth - Adoc
17 pages
Source/Replica Replication With Connector/J: Scaling Out Read Load by Distributing Read Traffic To Replicas
No ratings yet
Source/Replica Replication With Connector/J: Scaling Out Read Load by Distributing Read Traffic To Replicas
6 pages
SQL Injection Attack Lab
No ratings yet
SQL Injection Attack Lab
8 pages
Clarisuite Brochure With Zebra Pages 1120
No ratings yet
Clarisuite Brochure With Zebra Pages 1120
8 pages
Application of The Nordtest Method For Breal-Time Uncertainty Estimation of On-Line Field Measurement
No ratings yet
Application of The Nordtest Method For Breal-Time Uncertainty Estimation of On-Line Field Measurement
13 pages
ISMG6080 Database Management: Zhiping Walter Associate Professor of Information Systems
No ratings yet
ISMG6080 Database Management: Zhiping Walter Associate Professor of Information Systems
13 pages
Assignment of Relational Algebra With Solutions
No ratings yet
Assignment of Relational Algebra With Solutions
4 pages
Boomi Q&A Flashcards - Quizlet
100% (1)
Boomi Q&A Flashcards - Quizlet
11 pages

HTML Code

Uploaded by

HTML Code

Uploaded by

Ultimate Developer Command Guide

Python, PySpark & SQL Reference

Command Function Example

print() Outputs data to console print("Welcome to Python!") # Prints:

len() Returns length of an object my_list = [1, 2, 3]; print(len(my_list))

range() Generates a sequence of numbers for i in range(3): print(i) # Prints: 0,

def Defines a custom function def greet(name): return f"Hello,

import Imports a module or library import math; print(math.pi) # Prints:

if/elif/else Conditional logic x = 10; if x > 5: print("Big") else:

for Iterates over a sequence for fruit in ["apple", "banana"]:

while Loops until condition is false count = 0; while count < 3:

try/except Handles exceptions try: print(1/0) except

open() Opens a file for reading/writing with open("example.txt", "w") as f:

list.append() Adds an item to a list my_list = []; my_list.append(5);

dict.get() Retrieves value from dictionary my_dict = {"key": "value"};

SparkSession.builder Initializes a Spark from pyspark.sql import SparkSession; spark =

spark.read.csv() Loads CSV file into df = spark.read.csv("data.csv", header=True,

df.show() Displays first n df.show(3) # Shows first 3 rows

df.printSchema() Displays df.printSchema() # Shows column names and types

df.select() Selects specific df.select("name", "age").show() # Shows name and

df.where() Alias for filter df.where("salary > 50000").show() # Filters rows

df.groupBy().agg() Groups data and df.groupBy("department").agg({"salary":

df.join() Joins two df1.join(df2, df1.id == df2.id, "inner").show() #

df.withColumn() Adds or modifies a df.withColumn("age_plus_10", df.age + 10).show() #

df.withColumnRenamed() Renames a column df.withColumnRenamed("old_name", "new_name").show()

df.drop() Drops specified df.drop("salary").show() # Drops salary column

df.fillna() Replaces null df.fillna({"age": 0}).show() # Replaces null ages

df.dropDuplicates() Removes duplicate df.dropDuplicates(["name"]).show() # Drops

df.write.csv() Saves DataFrame df.write.csv("output.csv", mode="overwrite") #

df.createOrReplaceTempView() Registers df.createOrReplaceTempView("temp_table") # Creates

Window.partitionBy() Defines window for from pyspark.sql.window import Window; w =

Command Function Example

ORDER BY Sorts result set SELECT * FROM employees ORDER BY salary

GROUP BY Groups rows for aggregation SELECT department, AVG(salary) FROM

HAVING Filters grouped results SELECT department, COUNT(*) FROM

LIMIT Restricts number of returned rows SELECT * FROM employees LIMIT 5 #

UPDATE Modifies existing rows UPDATE employees SET salary = 60000

DROP TABLE Deletes a table DROP TABLE employees # Deletes employees

Cheat Sheet Summary

Print Tip: Use Ctrl+P (Win) / Cmd+P (Mac) to save as PDF

You might also like