0% found this document useful (0 votes)

3 views3 pages

HTML Code

The document is a comprehensive reference guide for essential commands in Python, PySpark, and SQL for developers. It includes command functions and examples for each programming language, providing quick access to key functionalities. The guide is version 2.0 and was updated in August 2024.

Uploaded by

wifaba2026

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views3 pages

HTML Code

Uploaded by

wifaba2026

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Ultimate Developer Command Guide

Python, PySpark & SQL Reference

Essential Commands for Developers

Python Commands

Command Function Example

print() Outputs data to console print("Welcome to Python!") # Prints:

Welcome to Python!

len() Returns length of an object my_list = [1, 2, 3]; print(len(my_list))

# Output: 3

range() Generates a sequence of numbers for i in range(3): print(i) # Prints: 0,

1, 2

def Defines a custom function def greet(name): return f"Hello,

{name}"; print(greet("Alice")) # Prints:
Hello, Alice

import Imports a module or library import math; print(math.pi) # Prints:

3.141592653589793

[x for x in Creates a list using comprehension squares = [x**2 for x in [1, 2, 3]];
iterable] print(squares) # Prints: [1, 4, 9]

if/elif/else Conditional logic x = 10; if x > 5: print("Big") else:

print("Small") # Prints: Big

for Iterates over a sequence for fruit in ["apple", "banana"]:

print(fruit) # Prints: apple, banana

while Loops until condition is false count = 0; while count < 3:

print(count); count += 1 # Prints: 0, 1,
2

try/except Handles exceptions try: print(1/0) except

ZeroDivisionError: print("Cannot divide
by zero") # Prints: Cannot divide by
zero

open() Opens a file for reading/writing with open("example.txt", "w") as f:

f.write("Hello") # Creates file with
text

list.append() Adds an item to a list my_list = []; my_list.append(5);

print(my_list) # Prints: [5]

dict.get() Retrieves value from dictionary my_dict = {"key": "value"};

print(my_dict.get("key")) # Prints:
value

PySpark Commands
Command/Function Function Example

SparkSession.builder Initializes a Spark from pyspark.sql import SparkSession; spark =

session SparkSession.builder.appName("MyApp").getOrCreate()

spark.read.csv() Loads CSV file into df = spark.read.csv("data.csv", header=True,

a DataFrame inferSchema=True); df.show() # Displays CSV data

df.show() Displays first n df.show(3) # Shows first 3 rows

rows of DataFrame

df.printSchema() Displays df.printSchema() # Shows column names and types

DataFrame schema

df.select() Selects specific df.select("name", "age").show() # Shows name and

columns age columns

df.filter() Filters rows based df.filter(df.age > 25).show() # Shows rows where
on condition age > 25

df.where() Alias for filter df.where("salary > 50000").show() # Filters rows

where salary > 50000

df.groupBy().agg() Groups data and df.groupBy("department").agg({"salary":

applies aggregation "avg"}).show() # Shows avg salary per dept

df.join() Joins two df1.join(df2, df1.id == df2.id, "inner").show() #

DataFrames Inner join on id

df.withColumn() Adds or modifies a df.withColumn("age_plus_10", df.age + 10).show() #

column Adds column with age + 10

df.withColumnRenamed() Renames a column df.withColumnRenamed("old_name", "new_name").show()

# Renames column

df.drop() Drops specified df.drop("salary").show() # Drops salary column

columns

df.fillna() Replaces null df.fillna({"age": 0}).show() # Replaces null ages

values with 0

df.dropDuplicates() Removes duplicate df.dropDuplicates(["name"]).show() # Drops

rows duplicate names

df.write.csv() Saves DataFrame df.write.csv("output.csv", mode="overwrite") #

as CSV Saves DataFrame to CSV

df.createOrReplaceTempView() Registers df.createOrReplaceTempView("temp_table") # Creates

DataFrame as SQL SQL view
table

spark.sql() Runs SQL query on spark.sql("SELECT name FROM temp_table WHERE age >
DataFrame 30").show() # Runs SQL query

Window.partitionBy() Defines window for from pyspark.sql.window import Window; w =

ranking/aggregation Window.partitionBy("dept").orderBy("salary");
df.withColumn("rank", row_number().over(w)).show()
# Adds rank column
SQL Commands

Command Function Example

SELECT Retrieves data from a table SELECT name, age FROM employees #
Selects name and age columns

WHERE Filters rows based on condition SELECT * FROM employees WHERE age > 30 #
Filters employees older than 30

ORDER BY Sorts result set SELECT * FROM employees ORDER BY salary

DESC # Sorts by salary in descending
order

GROUP BY Groups rows for aggregation SELECT department, AVG(salary) FROM

employees GROUP BY department # Avg
salary per dept

HAVING Filters grouped results SELECT department, COUNT(*) FROM

employees GROUP BY department HAVING
COUNT(*) > 5 # Depts with > 5 employees

JOIN Combines rows from multiple tables SELECT e.name, d.dept_name FROM
employees e JOIN departments d ON
e.dept_id = d.id # Joins tables

LEFT JOIN Includes all rows from left table SELECT e.name, d.dept_name FROM
employees e LEFT JOIN departments d ON
e.dept_id = d.id # Left join

LIMIT Restricts number of returned rows SELECT * FROM employees LIMIT 5 #

Returns first 5 rows

INSERT INTO Adds new rows to a table INSERT INTO employees (name, age) VALUES
('Alice', 28) # Inserts a new employee

UPDATE Modifies existing rows UPDATE employees SET salary = 60000

WHERE name = 'Alice' # Updates salary

DELETE Removes rows from a table DELETE FROM employees WHERE age < 18 #
Deletes rows where age < 18

CREATE TABLE Creates a new table CREATE TABLE employees (id INT, name
VARCHAR(50), age INT) # Creates
employees table

ALTER TABLE Modifies table structure ALTER TABLE employees ADD COLUMN salary
DECIMAL(10,2) # Adds salary column

DROP TABLE Deletes a table DROP TABLE employees # Deletes employees

table

Cheat Sheet Summary

Comprehensive reference for Python, PySpark, and SQL development tasks.
Version 2.0 | Updated: August 2024

Print Tip: Use Ctrl+P (Win) / Cmd+P (Mac) to save as PDF

Microsoft: Exam Questions DP-203
100% (2)
Microsoft: Exam Questions DP-203
17 pages
Pyspark Basics
No ratings yet
Pyspark Basics
16 pages
SQL Retail Sales Project
No ratings yet
SQL Retail Sales Project
5 pages
Data Science Tools Study Guides For MIT's 15.003
No ratings yet
Data Science Tools Study Guides For MIT's 15.003
23 pages
HTML Code
No ratings yet
HTML Code
4 pages
Comparison of SQL
No ratings yet
Comparison of SQL
11 pages
SQL Cheat Sheet Python
100% (1)
SQL Cheat Sheet Python
1 page
Pyspark Syntax Using Simple Examples
No ratings yet
Pyspark Syntax Using Simple Examples
28 pages
Python CheatSheet
No ratings yet
Python CheatSheet
2 pages
Practical
No ratings yet
Practical
12 pages
Spark Essentials
No ratings yet
Spark Essentials
15 pages
Data Analyst Cheat Sheet
No ratings yet
Data Analyst Cheat Sheet
28 pages
Pyspark Cheatsheet
No ratings yet
Pyspark Cheatsheet
21 pages
Pandas Dataframe All Operations 1735471870
No ratings yet
Pandas Dataframe All Operations 1735471870
4 pages
IP Imp Notes
No ratings yet
IP Imp Notes
5 pages
Python & MySQL For Data Analysis
No ratings yet
Python & MySQL For Data Analysis
45 pages
Rajni Ip File Final
No ratings yet
Rajni Ip File Final
42 pages
Informatics Practices Practical File
No ratings yet
Informatics Practices Practical File
8 pages
DEBasic Test Que NAns
No ratings yet
DEBasic Test Que NAns
15 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Question Bank-BDA (Module 1&2) 2
No ratings yet
Question Bank-BDA (Module 1&2) 2
5 pages
Pyspark and SQL
No ratings yet
Pyspark and SQL
57 pages
Pandas PDF
No ratings yet
Pandas PDF
25 pages
Data Analytics Using Python
No ratings yet
Data Analytics Using Python
10 pages
Super Study Guide: Data Science Tools: Afshine Amidi and Shervine Amidi August 21, 2020
No ratings yet
Super Study Guide: Data Science Tools: Afshine Amidi and Shervine Amidi August 21, 2020
23 pages
Deloitte Data Engineer Interview Experience (0-3 Yoe)
No ratings yet
Deloitte Data Engineer Interview Experience (0-3 Yoe)
22 pages
Python 2.1.3
No ratings yet
Python 2.1.3
6 pages
Ip File Class 12
No ratings yet
Ip File Class 12
26 pages
Pyspark SQL Basics Cheat Sheet: Python For Data Science
No ratings yet
Pyspark SQL Basics Cheat Sheet: Python For Data Science
1 page
Apache Spark Builtin Functions
No ratings yet
Apache Spark Builtin Functions
9 pages
I.P File
No ratings yet
I.P File
20 pages
Pandas Moderate
No ratings yet
Pandas Moderate
15 pages
Unit 4 Spark SQL
No ratings yet
Unit 4 Spark SQL
49 pages
Panda
No ratings yet
Panda
39 pages
Pyspark Distinct and Filter
No ratings yet
Pyspark Distinct and Filter
3 pages
Databricks Vs SQL Cheat Sheet
No ratings yet
Databricks Vs SQL Cheat Sheet
11 pages
Razorpay Data Analyst Interview Questions 1739977522
No ratings yet
Razorpay Data Analyst Interview Questions 1739977522
12 pages
NTU AB0403 Quiz Notes
No ratings yet
NTU AB0403 Quiz Notes
18 pages
SQL & Python Interview Q&A
No ratings yet
SQL & Python Interview Q&A
7 pages
Data Handling Module
No ratings yet
Data Handling Module
10 pages
Python Practical List 24
No ratings yet
Python Practical List 24
6 pages
Questions For Preparation
No ratings yet
Questions For Preparation
9 pages
Chapter-2 Python Pandas
100% (2)
Chapter-2 Python Pandas
33 pages
Pandas - Programs
No ratings yet
Pandas - Programs
22 pages
Data Engineering for Beginners
No ratings yet
Data Engineering for Beginners
129 pages
Pandas
No ratings yet
Pandas
13 pages
Data Analtycs Professional-1
No ratings yet
Data Analtycs Professional-1
15 pages
Pandas Cheat Sheet - Python For Data Science
No ratings yet
Pandas Cheat Sheet - Python For Data Science
5 pages
Deloite Data Engineer Interview Questions
No ratings yet
Deloite Data Engineer Interview Questions
24 pages
Essential SQL Queries Reference Guide
No ratings yet
Essential SQL Queries Reference Guide
8 pages
Class 12 IP Practical Record
No ratings yet
Class 12 IP Practical Record
33 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
Practical 2024
No ratings yet
Practical 2024
10 pages
Ip Sample Paper 6 Answer Key
No ratings yet
Ip Sample Paper 6 Answer Key
6 pages
Wipro Data Analyst Interview Questions
No ratings yet
Wipro Data Analyst Interview Questions
29 pages
Pyspark - DataFrame Window Functions
No ratings yet
Pyspark - DataFrame Window Functions
3 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Simplifying Data Science With Python
From Everand
Simplifying Data Science With Python
Billy David millican
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
C++ Functions and tutorial
From Everand
C++ Functions and tutorial
Nino Paiotta
No ratings yet
Java Programming Tutorial With Screen Shots & Many Code Example
From Everand
Java Programming Tutorial With Screen Shots & Many Code Example
Desmond Ohwofosirai
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
SQL Cheat Sheet
100% (1)
SQL Cheat Sheet
4 pages
Jdegtaddupdate - Image/ Jdegtaddupdate - Imagekeystr: Syntax
No ratings yet
Jdegtaddupdate - Image/ Jdegtaddupdate - Imagekeystr: Syntax
5 pages
Renewed Merged
No ratings yet
Renewed Merged
50 pages
Chapter-1 (DBMS & SQL)
No ratings yet
Chapter-1 (DBMS & SQL)
60 pages
CBSE Class 12 Informatics Practices Marking Scheme Question Paper 2020-21
No ratings yet
CBSE Class 12 Informatics Practices Marking Scheme Question Paper 2020-21
12 pages
Vendor: IBM Exam Code: 000-545 Exam Name: DB2 9.7 SQL Procedure Developer Version: DEMO
No ratings yet
Vendor: IBM Exam Code: 000-545 Exam Name: DB2 9.7 SQL Procedure Developer Version: DEMO
8 pages
Ids 521 Adbms - Hw3 Updated
No ratings yet
Ids 521 Adbms - Hw3 Updated
5 pages
Oracle Day - 3 - Constraints
No ratings yet
Oracle Day - 3 - Constraints
5 pages
How To Resolve Costing Errors
No ratings yet
How To Resolve Costing Errors
38 pages
Absolutely Awesome Book On CSharp and .NET - Sample Chapters PDF
No ratings yet
Absolutely Awesome Book On CSharp and .NET - Sample Chapters PDF
96 pages
CS Practical File
No ratings yet
CS Practical File
88 pages
AbInitio Components
No ratings yet
AbInitio Components
6 pages
ToadForOracle 14.0 ReleaseNotes
No ratings yet
ToadForOracle 14.0 ReleaseNotes
47 pages
Chapter 4 Relational Database Model
No ratings yet
Chapter 4 Relational Database Model
33 pages
SQL and SQL Plus Basics
100% (2)
SQL and SQL Plus Basics
69 pages
Encoy Anime Hub
No ratings yet
Encoy Anime Hub
21 pages
Oracle Certkey 1z0-082 v2021-04-27 by Lucas 58q
No ratings yet
Oracle Certkey 1z0-082 v2021-04-27 by Lucas 58q
36 pages
Cit208 Calculus Educational Consult 2020 - 1
No ratings yet
Cit208 Calculus Educational Consult 2020 - 1
34 pages
Constraints in SQL Are Not Mandatory To Use While Creating The Table
No ratings yet
Constraints in SQL Are Not Mandatory To Use While Creating The Table
16 pages
Kartik Dbms File
No ratings yet
Kartik Dbms File
31 pages
SQL Dumps
No ratings yet
SQL Dumps
7 pages
Databricks SQL 2024
100% (1)
Databricks SQL 2024
600 pages
Standard SQL Functions Cheat Sheet A3
No ratings yet
Standard SQL Functions Cheat Sheet A3
1 page
Aggregate Function in Oracle
No ratings yet
Aggregate Function in Oracle
12 pages
DBMS 2nd Unit Notes
No ratings yet
DBMS 2nd Unit Notes
14 pages
PRT 1 Q's
No ratings yet
PRT 1 Q's
18 pages
Ora VB
No ratings yet
Ora VB
27 pages
Ip2 Model Exam
No ratings yet
Ip2 Model Exam
14 pages

HTML Code

Uploaded by

HTML Code

Uploaded by

Ultimate Developer Command Guide

Python, PySpark & SQL Reference

Command Function Example

print() Outputs data to console print("Welcome to Python!") # Prints:

len() Returns length of an object my_list = [1, 2, 3]; print(len(my_list))

range() Generates a sequence of numbers for i in range(3): print(i) # Prints: 0,

def Defines a custom function def greet(name): return f"Hello,

import Imports a module or library import math; print(math.pi) # Prints:

if/elif/else Conditional logic x = 10; if x > 5: print("Big") else:

for Iterates over a sequence for fruit in ["apple", "banana"]:

while Loops until condition is false count = 0; while count < 3:

try/except Handles exceptions try: print(1/0) except

open() Opens a file for reading/writing with open("example.txt", "w") as f:

list.append() Adds an item to a list my_list = []; my_list.append(5);

dict.get() Retrieves value from dictionary my_dict = {"key": "value"};

SparkSession.builder Initializes a Spark from pyspark.sql import SparkSession; spark =

spark.read.csv() Loads CSV file into df = spark.read.csv("data.csv", header=True,

df.show() Displays first n df.show(3) # Shows first 3 rows

df.printSchema() Displays df.printSchema() # Shows column names and types

df.select() Selects specific df.select("name", "age").show() # Shows name and

df.where() Alias for filter df.where("salary > 50000").show() # Filters rows

df.groupBy().agg() Groups data and df.groupBy("department").agg({"salary":

df.join() Joins two df1.join(df2, df1.id == df2.id, "inner").show() #

df.withColumn() Adds or modifies a df.withColumn("age_plus_10", df.age + 10).show() #

df.withColumnRenamed() Renames a column df.withColumnRenamed("old_name", "new_name").show()

df.drop() Drops specified df.drop("salary").show() # Drops salary column

df.fillna() Replaces null df.fillna({"age": 0}).show() # Replaces null ages

df.dropDuplicates() Removes duplicate df.dropDuplicates(["name"]).show() # Drops

df.write.csv() Saves DataFrame df.write.csv("output.csv", mode="overwrite") #

df.createOrReplaceTempView() Registers df.createOrReplaceTempView("temp_table") # Creates

Window.partitionBy() Defines window for from pyspark.sql.window import Window; w =

Command Function Example

ORDER BY Sorts result set SELECT * FROM employees ORDER BY salary

GROUP BY Groups rows for aggregation SELECT department, AVG(salary) FROM

HAVING Filters grouped results SELECT department, COUNT(*) FROM

LIMIT Restricts number of returned rows SELECT * FROM employees LIMIT 5 #

UPDATE Modifies existing rows UPDATE employees SET salary = 60000

DROP TABLE Deletes a table DROP TABLE employees # Deletes employees

Cheat Sheet Summary

Print Tip: Use Ctrl+P (Win) / Cmd+P (Mac) to save as PDF

You might also like