v5coder
diff --git a/‎.gitignore
Lines changed: 3 additions & 0 deletions b/‎.gitignore
Lines changed: 3 additions & 0 deletions
diff --git a/‎README.md
Lines changed: 43 additions & 0 deletions b/‎README.md
Lines changed: 43 additions & 0 deletions
diff --git a/‎build-image/Dockerfile
Lines changed: 69 additions & 0 deletions b/‎build-image/Dockerfile
Lines changed: 69 additions & 0 deletions
@@ -0,0 +1,3 @@
+**/target
+**/dependency-reduced-pom.xml
+**/.idea
@@ -0,0 +1,43 @@
+# Apache Flink® SQL Demo
+
+**This repository provides a demo for Flink SQL.**
+
+The demo shows how to:
+
+* Setup Flink SQL with a Hive catalog.
+* Use Flink SQL to prototype a query on a small CSV sample data set.
+* Run the same query on a larger ORC data set.
+* Run the same query as a continuous query on a Kafka topic.
+* Run differnet streaming SQL queries including pattern matching with `MATCH_RECOGNIZE`
+* Maintain a materialized view in MySQL
+
+### Requirements
+
+The demo is based on Flink's SQL CLI client and uses Docker Compose to setup the training environment.
+
+You **only need [Docker](https://www.docker.com/)** to run this training. </br>
+
+## What is Apache Flink?
+
+[Apache Flink](https://flink.apache.org) is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams. Flink has been designed to run in all common cluster environments, perform computations at in-memory speed and at any scale.
+
+## What is SQL on Apache Flink?
+
+Flink features multiple APIs with different levels of abstraction. SQL is supported by Flink as a unified API for batch and stream processing, i.e., queries are executed with the same semantics on unbounded, real-time streams or bounded, recorded streams and produce the same results. SQL on Flink is commonly used to ease the definition of data analytics, data pipelining, and ETL applications.
+
+The following example shows a SQL query that computes the number of departing taxi rides per hour. 
+
+```sql
+SELECT
+  TUMBLE_START(rowTime, INTERVAL '1' HOUR) AS t,
+  COUNT(*) AS cnt
+FROM Rides
+WHERE
+  isStart
+GROUP BY 
+  TUMBLE(rowTime, INTERVAL '1' HOUR)
+```
+
+----
+
+*Apache Flink, Flink®, Apache®, the squirrel logo, and the Apache feather logo are either registered trademarks or trademarks of The Apache Software Foundation.*
@@ -0,0 +1,69 @@
+###############################################################################
+#  Licensed to the Apache Software Foundation (ASF) under one
+#  or more contributor license agreements.  See the NOTICE file
+#  distributed with this work for additional information
+#  regarding copyright ownership.  The ASF licenses this file
+#  to you under the Apache License, Version 2.0 (the
+#  "License"); you may not use this file except in compliance
+#  with the License.  You may obtain a copy of the License at
+#
+#      http://www.apache.org/licenses/LICENSE-2.0
+#
+#  Unless required by applicable law or agreed to in writing, software
+#  distributed under the License is distributed on an "AS IS" BASIS,
+#  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+#  See the License for the specific language governing permissions and
+# limitations under the License.
+###############################################################################
+
+###############################################################################
+# Build Click Count Job
+###############################################################################
+
+FROM maven:3.6-jdk-8-slim AS builder
+
+# Get UDF code and compile it
+COPY ./java/sql-training-udfs /opt/sql-udfs
+RUN cd /opt/sql-udfs; \
+    mvn clean install
+
+# Get data producer code and compile it
+COPY ./java/sql-training-data-producer /opt/data-producer
+RUN cd /opt/data-producer; \
+    mvn clean install
+
+###############################################################################
+# Build SQL Playground Image
+###############################################################################
+
+FROM flink:1.10.0-scala_2.11
+
+ADD VERSION .
+
+# Copy sql-client configuration
+COPY sql-client/ /opt/sql-client
+
+# Copy playground UDFs
+COPY --from=builder /opt/sql-udfs/target/sql-training-udfs-*.jar /opt/sql-client/lib/
+
+# Copy data producer
+COPY --from=builder /opt/data-producer/target/sql-training-data-producer-*.jar /opt/data/data-producer.jar
+
+# Download connector libraries
+RUN wget -P /opt/sql-client/lib/ https://repo.maven.apache.org/maven2/org/apache/flink/flink-json/${FLINK_VERSION}/flink-json-${FLINK_VERSION}.jar; \
+    wget -P /opt/sql-client/lib/ https://repo.maven.apache.org/maven2/org/apache/flink/flink-sql-connector-kafka_2.11/${FLINK_VERSION}/flink-sql-connector-kafka_2.11-${FLINK_VERSION}.jar; \
+    wget -P /opt/sql-client/lib/ https://repo.maven.apache.org/maven2/org/apache/flink/flink-jdbc_2.11/1.10.0/flink-jdbc_2.11-1.10.0.jar; \
+    wget -P /opt/sql-client/lib/ https://repo.maven.apache.org/maven2/mysql/mysql-connector-java/8.0.19/mysql-connector-java-8.0.19.jar; \
+# Create data folders
+    mkdir -p /opt/data; \
+    mkdir -p /opt/data/stream; \
+# Download data files
+    wget -O /opt/data/driverChanges.txt.gz 'https://drive.google.com/uc?export=download&id=1pf4tfv-YpoVQ9_O0948M8oXeCfVH-0MH'; \
+    wget -O /opt/data/fares.txt.gz 'https://drive.google.com/uc?export=download&id=1SriiwcIdMvY7uJsWSY4Hhh32iO3F4ND2'; \
+    wget -O /opt/data/rides.txt.gz 'https://drive.google.com/uc?export=download&id=1gY8W07OFvB7_4lHlAyingM4WQzs0_8lT';
+
+# Copy configuration
+COPY conf/* /opt/flink/conf/
+
+WORKDIR /opt/sql-client
+ENV SQL_CLIENT_HOME /opt/sql-client
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+**/target`
	`2`	`+**/dependency-reduced-pom.xml`
	`3`	`+**/.idea`