Microsoft Scope

Jingren Zhou
Microsoft Corp.
Large-scale Distributed Computing

... ...
How to program the beast?
Large data centers (x1000 machines): storage and computation Key technology for search (Bing, Google, Yahoo) Web data analysis, user log analysis, relevance studies, etc.
Map-Reduce / GFS
GFS / Bigtable provide distributed storage
The Map-Reduce programming model Good abstraction of group-by-aggregation operations
Map function -> grouping Reduce function -> aggregation
Very rigid: every computation has to be structured as a
sequence of map-reduce pairs Not completely transparent: users still have to use a parallel mindset Error-prone and suboptimal: writing map-reduce programs is equivalent to writing physical execution plans in DBMS
Pig Latin / Hadoop

Hadoop: distributed file system and map-reduce
execution engine Pig Latin: a dataflow language using a nested data model
Imperative programming style Relational data manipulation primitives and plug-in
code to customize processing New syntax need to learn a new language Queries are mapped to map-reduce engine
SCOPE / Cosmos
Cosmos Storage System Append-only distributed file system for storing petabytes of data Optimized for sequential I/O Data is compressed and replicated Cosmos Execution Environment Flexible model: a job is a DAG (directed acyclic graph)

Vertices -> processes, edges -> data flows The job manager schedules and coordinates vertex execution
Provides runtime optimization, fault
tolerance, resource management
SCOPE
tructured omputations
ptimized for arallel xecution A declarative scripting language

Easy to use: SQL-like syntax plus MapRecuce-like extensions Modular: provides a rich class of runtime operators Highly extensible: Fully integrated with .NET framework Provides interfaces for customized operations Flexible programming style: nested expressions or a series of
simple transformations
Users focus on problem solving as if on a single machine System complexity and parallelism are hidden
An Example: QCount
Compute the popular queries that have been requested at least 1000 times
SELECT query, COUNT(*) AS count FROM search.log USING LogExtractor GROUP BY query HAVING count> 1000 ORDER BY count DESC; OUTPUT TO qcount.result e = EXTRACT query FROM search.log USING LogExtractor; s1 = SELECT query, COUNT(*) AS count FROM e GROUP BY query; s2 = SELECT query, count FROM s1 WHERE count> 1000; s3 = SELECT query, count FROM s2 ORDER BY count DESC; OUTPUT s3 TO qcount.result
Data model: a relational rowset with well-defined schema
Input and Output

SCOPE works on both relational and nonrelational data sources
EXTRACT and OUTPUT commands provide a relational abstraction of
underlying data sources

EXTRACT column[:<type>] [, ] FROM < input_stream(s) > USING <Extractor> [(args)] [HAVING <predicate>] OUTPUT [<input>] TO <output_stream> [USING <Outputter> [(args)]]
Built-in/customized extractors and outputters (C# classes)

public class LineitemExtractor : Extractor { public override Schema Produce(string[] requestedColumns, string[] args) {} public override IEnumerable<Row> Extract(StreamReader reader, Row outputRow, string[] args) {} }
Select and Join

SELECT [DISTINCT] [TOP count] select_expression [AS <name>] [, ] FROM { <input stream(s)> USING <Extractor> | {<input> [<joined input> []]} [, ] } [WHERE <predicate>] [GROUP BY <grouping_columns> [, ] ] [HAVING <predicate>] [ORDER BY <select_list_item> [ASC | DESC] [, ]] joined input: <join_type> JOIN <input> [ON <equijoin>] join_type: [INNER | {LEFT | RIGHT | FULL} OUTER]
Supports different Agg functions: COUNT, COUNTIF, MIN, MAX,
SUM, AVG, STDEV, VAR, FIRST, LAST. No subqueries (but same functionality available because of outer join)
Deep Integration with .NET (C#)

SCOPE supports C# expressions and built-in .NET functions/library
User-defined scalar expressions User-defined aggregation functions
R1 = SELECT A+C AS ac, B.Trim() AS B1 FROM R WHERE StringOccurs(C, xyz) > 2 #CS public static int StringOccurs(string str, string ptrn) {} #ENDCS
User Defined Operators

SCOPE supports three highly extensible commands: PROCESS,
REDUCE, and COMBINE

Complements SELECT for complicated analysis Easy to customize by extending built-in C# components Easy to reuse code in other SCOPE scripts
Process
PROCESS command takes a rowset as input, processes each row, and
outputs a sequence of rows

PROCESS [<input>] USING <Processor> [ (args) ] [PRODUCE column [, ]] [WHERE <predicate> ] [HAVING <predicate> ]
public class MyProcessor : Processor { public override Schema Produce(string[] requestedColumns, string[] args, Schema inputSchema) {} public override IEnumerable<Row> Process(RowSet input, Row outRow, string[] args) {} }
Reduce
REDUCE command takes a grouped rowset, processes each group, and
outputs zero, one, or multiple rows per group

REDUCE [<input> [PRESORT column [ASC|DESC] [, ]]] ON grouping_column [, ] USING <Reducer> [ (args) ] [PRODUCE column [, ]] [WHERE <predicate> ] [HAVING <predicate> ]
public class MyReducer : Reducer { public override Schema Produce(string[] requestedColumns, string[] args, Schema inputSchema) {}
public override IEnumerable<Row> Reduce(RowSet input, Row outRow, string[] args) {}

}
Map/Reduce can be easily expressed by Process/Reduce
Combine
COMBINE command takes two matching input rowsets, combines
them in some way, and outputs a sequence of rows

COMBINE <input1> [AS <alias1>] [PRESORT ] WITH <input2> [AS <alias2>] [PRESORT ] ON <equality_predicate> USING <Combiner> [ (args) ] PRODUCE column [, ] [HAVING <expression> ] COMBINE S1 WITH S2 ON S1.A==S2.A AND S1.B==S2.B AND S1.C==S2.C USING MyCombiner PRODUCE D, E, F
public class MyCombiner : Combiner { public override Schema Produce(string[] requestedColumns, string[] args, Schema leftSchema, string leftTable, Schema rightSchema, string rightTable) {} public override IEnumerable<Row> Combine(RowSet left, RowSet right, Row outputRow, string[] args) {} }
Importing Scripts
IMPORT <script_file> [PARAMS <par_name> = <value> [,]]
Combines the benefits of virtual views and stored procedures in SQL Enables modularity and information hiding Improves reusability and allows parameterization Provides a security mechanism
Q1 = IMPORT MyView.script PARAMS logfile=Queries_Jan.log, limit=1000;
E = EXTRACT query FROM @@logfile@@ USING LogExtractor ;
EXPORT R = SELECT query, COUNT() AS count FROM E GROUP BY query HAVING count > @@limit@@;
Q2 = IMPORT MyView.script PARAMS logfile=Queries_Feb.log, limit=1000;
Life of a SCOPE Query

Scope Queries
Parser / Compiler / Security
...
...
Optimizer and Runtime

Scope Queries
(Logical Operator Trees)
Logical Operators Cardinality Estimation Cost Estimat ion Physical operators
Optimization Rules
Transformation Engine
Optimal Query Plans
(Vertex DAG)
SCOPE optimizer Transformation-based optimizer Reasons about plan properties (partitioning, grouping, sorting, etc.) Chooses an optimal plan based on cost estimates
Vertex DAG: each vertex contains a pipeline of operators
SCOPE Runtime Provides a rich class of composable physical operators Operators are implemented using the iterator model Executes a series of operators in a pipelined fashion
Example Query Plan (QCount)

SELECT query, COUNT(*) AS count FROM search.log USING LogExtractor GROUP BY query HAVING count> 1000 ORDER BY count DESC; OUTPUT TO qcount.result
1. 2. 3. 4. 5. 6. 7. 8.
Extract the input cosmos file Partially aggregate at the rack level Partition on query Fully aggregate Apply filter on count Sort results in parallel Merge results Output as a cosmos file
TPC-H Query 2
// Extract region, nation, supplier, partsupp, part RNS_JOIN = SELECT s_suppkey, n_name FROM region, nation, supplier WHERE r_regionkey == n_regionkey AND n_nationkey == s_nationkey; RNSPS_JOIN = SELECT p_partkey, ps_supplycost, ps_suppkey, p_mfgr, n_name FROM part, partsupp, rns_join WHERE p_partkey == ps_partkey AND s_suppkey == ps_suppkey; SUBQ = SELECT p_partkey AS subq_partkey, MIN(ps_supplycost) AS min_cost FROM rnsps_join GROUP BY p_partkey; RESULT = SELECT s_acctbal, s_name, p_partkey, p_mfgr, s_address, s_phone, s_comment FROM rnsps_join AS lo, subq AS sq, supplier AS s WHERE lo.p_partkey == sq.subq_partkey AND lo.ps_supplycost == min_cost AND lo.ps_suppkey == s.s_suppkey ORDER BY acctbal DESC, n_name, s_name, partkey; OUTPUT RESULT TO "tpchQ2.tbl";
Sub Execution Plan to TPCH Q2

1. 2. 3. 4. 5. 6.
7. 8.
9.
Join on suppkey Partially aggregate at the rack level Partition on group-by column Fully aggregate Partition on partkey Merge corresponding partitions Partition on partkey Merge corresponding partitions Perform join
A Real Example
Conclusions
SCOPE: a new scripting language for large-scale analysis Strong resemblance to SQL: easy to learn and port existing applications Very extensible

Fully benefits from .NET library Supports built-in C# templates for customized operations Supports a rich class of physical operators Great reusability with views, user-defined operators Improves productivity Implementation details (including parallelism, system complexity) are transparent to users Allows sophisticated optimization Good foundation for performance study and improvement
Highly composable

High-level declarative language

Current/Future Work
Language enhancements Sharing, data mining, etc. Query optimization Auto tuning physical storage design Materialized view optimization Common subexpression exploitation Progressive query optimization Runtime optimization New execution strategies Self-adaptive dynamic query plans

Microsoft Scope

Uploaded by

Copyright:

Available Formats

Microsoft Scope

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Microsoft Scope

Uploaded by

Copyright:

Available Formats

Jingren Zhou

Large-scale Distributed Computing

Map function -> grouping Reduce function -> aggregation

Very rigid: every computation has to be structured as a

Pig Latin / Hadoop

Provides runtime optimization, fault

tolerance, resource management

ptimized for arallel xecution A declarative scripting language

Data model: a relational rowset with well-defined schema

Input and Output

underlying data sources

Built-in/customized extractors and outputters (C# classes)

Select and Join

Supports different Agg functions: COUNT, COUNTIF, MIN, MAX,

Deep Integration with .NET (C#)

User Defined Operators

REDUCE, and COMBINE

outputs a sequence of rows

outputs zero, one, or multiple rows per group

public override IEnumerable<Row> Reduce(RowSet input, Row outRow, string[] args) {}

Map/Reduce can be easily expressed by Process/Reduce

them in some way, and outputs a sequence of rows

E = EXTRACT query FROM @@logfile@@ USING LogExtractor ;

Q2 = IMPORT MyView.script PARAMS logfile=Queries_Feb.log, limit=1000;

Life of a SCOPE Query

Optimizer and Runtime

Vertex DAG: each vertex contains a pipeline of operators

Example Query Plan (QCount)

Sub Execution Plan to TPCH Q2

High-level declarative language

You might also like