Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dev #2

Open
wants to merge 76 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
76 commits
Select commit Hold shift + click to select a range
ea20e08
Adding dataset: azureSynapseTripTable
vishalghelani Jul 16, 2021
4b7d2f3
Renaming pipeline: TripFaresDataPipeline as PrepareDataPipeline
vishalghelani Jul 16, 2021
68a78bd
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
ebc8fcf
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
3a45f07
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
734d97b
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
7467925
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
716e023
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
4505f1f
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
daf1e47
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
fdc1cca
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
058b1ff
Adding linkedService: CosmosDb
vishalghelani Jul 16, 2021
e825bb2
Adding dataset: faresFileSink
vishalghelani Jul 16, 2021
6a911a6
Updating dataset: faresFileSink
vishalghelani Jul 16, 2021
b28e0c2
Adding dataset: faresFileSource
vishalghelani Jul 16, 2021
fda4fa1
Updating dataset: faresFileSource
vishalghelani Jul 16, 2021
998f2fb
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
34ad85d
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
c7dc4b5
Adding dataset: tripFileSink
vishalghelani Jul 16, 2021
40ebbb0
Adding dataset: tripsFileSource
vishalghelani Jul 16, 2021
62226d6
Updating dataset: tripsFileSource
vishalghelani Jul 16, 2021
53c32e5
Deleting dataset: faresFileSource
vishalghelani Jul 16, 2021
4a0c08a
Adding dataset: faresFileSource
vishalghelani Jul 16, 2021
67336ff
Updating dataset: faresFileSource
vishalghelani Jul 16, 2021
0f1842d
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
7d66d74
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
46ca568
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
f53e1e9
Updating dataset: tripFileSink
vishalghelani Jul 16, 2021
e688931
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
c43e3b4
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
e069c57
Updating dataset: faresFileSource
vishalghelani Jul 16, 2021
e7e984c
Updating dataset: tripsFileSource
vishalghelani Jul 16, 2021
752c0ba
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
225ffa5
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
4e91a9f
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
a8ff824
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
605474a
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
ec3599d
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
f3fa5e8
Updating dataset: tripsDataSource
vishalghelani Jul 16, 2021
d199975
Updating dataset: faresDataSource
vishalghelani Jul 16, 2021
65f889a
Updating dataset: tripsDataSource
vishalghelani Jul 16, 2021
97802aa
Updating dataset: CosmosDbSqlApiCollection
vishalghelani Jul 16, 2021
204d4d3
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
1b6570a
Updating pipeline: PrepareDataPipeline
vishalghelani Jul 16, 2021
74935ef
Updating dataset: faresFileSource
vishalghelani Jul 16, 2021
323129b
Updating dataset: CosmosDbSqlApiCollection
vishalghelani Jul 16, 2021
5174555
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
a24e1f2
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
09ec3ac
Adding integrationRuntime: IntegrationRuntime2
vishalghelani Jul 16, 2021
d75d1e8
Updating pipeline: PrepareDataPipeline
vishalghelani Jul 16, 2021
d167a27
Adding linkedService: PowerBILinkService
vishalghelani Jul 16, 2021
eca6974
Adding dataset: ReportDataSink
vishalghelani Jul 16, 2021
d1b9fcd
Updating dataset: ReportDataSink
vishalghelani Jul 16, 2021
d3a0801
Updating pipeline: PrepareDataPipeline
vishalghelani Jul 16, 2021
9fb5e27
Updating pipeline: PrepareDataPipeline
vishalghelani Jul 16, 2021
89e7f1a
Updating pipeline: IngestionPipeline
vishalghelani Jul 16, 2021
a4d0607
Updating pipeline: PrepareDataPipeline
vishalghelani Jul 16, 2021
01449de
Updating dataset: ReportDataSink
vishalghelani Jul 16, 2021
5d30e14
Deleting integrationRuntime: IntegrationRuntime2
vishalghelani Jul 16, 2021
b437c55
Adding pipeline: IngestionParquet
vishalghelani Jul 16, 2021
eefe2fe
Adding dataset: tripFileSink_parquet
vishalghelani Jul 16, 2021
61cc06c
Deleting dataset: tripFileSink_parquet
vishalghelani Jul 16, 2021
1846e4a
Adding dataset: TripParquetFile
vishalghelani Jul 16, 2021
cb515df
Renaming dataset: TripParquetFile as tripParquetFile
vishalghelani Jul 16, 2021
c3c61a8
Updating dataset: tripParquetFile
vishalghelani Jul 16, 2021
d9317d0
Updating dataset: tripParquetFile
vishalghelani Jul 16, 2021
6469e2a
Updating dataset: tripParquetFile
vishalghelani Jul 16, 2021
050c9e0
Updating pipeline: IngestionParquet
vishalghelani Jul 16, 2021
f1e8749
Updating dataset: tripParquetFile
vishalghelani Jul 16, 2021
2720a17
Updating pipeline: IngestionParquet
vishalghelani Jul 16, 2021
ac253b0
Deleting pipeline: IngestionParquet
vishalghelani Jul 16, 2021
0a3fa18
Deleting dataset: ReportDataSink
vishalghelani Jul 16, 2021
2fece97
Adding dataset: tripReport
vishalghelani Jul 16, 2021
09bdca1
Updating pipeline: PrepareDataPipeline
vishalghelani Jul 16, 2021
0f7c8e7
Deleting dataset: tripParquetFile
vishalghelani Jul 16, 2021
00f5bbf
Adding sqlscript: SQL script 1
vishalghelani Jul 16, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
15 changes: 15 additions & 0 deletions synapsepoc/dataset/CosmosDbSqlApiCollection.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
{
"name": "CosmosDbSqlApiCollection",
"properties": {
"linkedServiceName": {
"referenceName": "CosmosDb",
"type": "LinkedServiceReference"
},
"annotations": [],
"type": "CosmosDbSqlApiCollection",
"schema": {},
"typeProperties": {
"collectionName": "fares"
}
}
}
57 changes: 57 additions & 0 deletions synapsepoc/dataset/azureSynapseTripTable.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,57 @@
{
"name": "azureSynapseTripTable",
"properties": {
"linkedServiceName": {
"referenceName": "TripFaresSynapseAnalyticsLinkedService",
"type": "LinkedServiceReference",
"parameters": {
"SynapseWorkspaceName": {
"value": "@dataset().SynapseWorkspaceName",
"type": "Expression"
},
"SQLDedicatedPoolName": {
"value": "@dataset().SQLDedicatedPoolName",
"type": "Expression"
},
"keyVaultName": {
"value": "@dataset().keyVaultName",
"type": "Expression"
},
"SQLLoginUsername": {
"value": "@dataset().SQLLoginUsername",
"type": "Expression"
}
}
},
"parameters": {
"SchemaName": {
"type": "string"
},
"SynapseWorkspaceName": {
"type": "string"
},
"SQLDedicatedPoolName": {
"type": "string"
},
"keyVaultName": {
"type": "string"
},
"SQLLoginUsername": {
"type": "string"
}
},
"folder": {
"name": "TripFareDatasets"
},
"annotations": [],
"type": "AzureSqlDWTable",
"schema": [],
"typeProperties": {
"schema": {
"value": "@dataset().SchemaName",
"type": "Expression"
},
"table": "Trips"
}
}
}
60 changes: 56 additions & 4 deletions synapsepoc/dataset/faresDataSource.json
Original file line number Diff line number Diff line change
Expand Up @@ -2,8 +2,12 @@
"name": "faresDataSource",
"properties": {
"linkedServiceName": {
"referenceName": "HttpServerTripFareDataLinkedService",
"type": "LinkedServiceReference"
"referenceName": "TripFaresDataLakeStorageLinkedService",
"type": "LinkedServiceReference",
"parameters": {
"keyVaultName": "kvdemomukdgps4msibmpoc",
"datalakeAccountName": "demomukdgps4msibmpoc"
}
},
"folder": {
"name": "TripFareDatasets"
Expand All @@ -12,13 +16,61 @@
"type": "DelimitedText",
"typeProperties": {
"location": {
"type": "HttpServerLocation"
"type": "AzureBlobFSLocation",
"fileName": "fares-data.csv",
"folderPath": "analytics-data",
"fileSystem": "public"
},
"columnDelimiter": ",",
"escapeChar": "\\",
"firstRowAsHeader": true,
"quoteChar": "\""
},
"schema": []
"schema": [
{
"name": "medallion",
"type": "String"
},
{
"name": "hack_license",
"type": "String"
},
{
"name": "vendor_id",
"type": "String"
},
{
"name": "pickup_datetime",
"type": "String"
},
{
"name": "payment_type",
"type": "String"
},
{
"name": "fare_amount",
"type": "String"
},
{
"name": "surcharge",
"type": "String"
},
{
"name": "mta_tax",
"type": "String"
},
{
"name": "tip_amount",
"type": "String"
},
{
"name": "tolls_amount",
"type": "String"
},
{
"name": "total_amount",
"type": "String"
}
]
}
}
90 changes: 90 additions & 0 deletions synapsepoc/dataset/faresFileSink.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,90 @@
{
"name": "faresFileSink",
"properties": {
"linkedServiceName": {
"referenceName": "TripFaresDataLakeStorageLinkedService",
"type": "LinkedServiceReference",
"parameters": {
"keyVaultName": {
"value": "@dataset().keyVaultName",
"type": "Expression"
},
"datalakeAccountName": {
"value": "@dataset().datalakeAccountName",
"type": "Expression"
}
}
},
"parameters": {
"keyVaultName": {
"type": "string"
},
"datalakeAccountName": {
"type": "string"
}
},
"folder": {
"name": "TripFareDatasets"
},
"annotations": [],
"type": "DelimitedText",
"typeProperties": {
"location": {
"type": "AzureBlobFSLocation",
"fileName": "fares-data.csv",
"folderPath": "analytics-data",
"fileSystem": "public"
},
"columnDelimiter": ",",
"escapeChar": "\\",
"firstRowAsHeader": true,
"quoteChar": "\""
},
"schema": [
{
"name": "medallion",
"type": "String"
},
{
"name": "hack_license",
"type": "String"
},
{
"name": "vendor_id",
"type": "String"
},
{
"name": "pickup_datetime",
"type": "String"
},
{
"name": "payment_type",
"type": "String"
},
{
"name": "fare_amount",
"type": "String"
},
{
"name": "surcharge",
"type": "String"
},
{
"name": "mta_tax",
"type": "String"
},
{
"name": "tip_amount",
"type": "String"
},
{
"name": "tolls_amount",
"type": "String"
},
{
"name": "total_amount",
"type": "String"
}
]
}
}
67 changes: 67 additions & 0 deletions synapsepoc/dataset/faresFileSource.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,67 @@
{
"name": "faresFileSource",
"properties": {
"linkedServiceName": {
"referenceName": "TripFaresDataLakeStorageLinkedService",
"type": "LinkedServiceReference",
"parameters": {
"keyVaultName": "kvdemomukdgps4msibmpoc",
"datalakeAccountName": "demomukdgps4msibmpoc"
}
},
"folder": {
"name": "TripFareDatasets"
},
"annotations": [],
"type": "Json",
"typeProperties": {
"location": {
"type": "AzureBlobFSLocation",
"fileName": "fares-data.json",
"folderPath": "raw-data",
"fileSystem": "public"
}
},
"schema": {
"type": "object",
"properties": {
"FareID": {
"type": "integer"
},
"medallion": {
"type": "string"
},
"hack_license": {
"type": "string"
},
"vendor_id": {
"type": "string"
},
"pickup_datetime": {
"type": "string"
},
"payment_type": {
"type": "string"
},
"fare_amount": {
"type": "number"
},
"surcharge": {
"type": "number"
},
"mta_tax": {
"type": "number"
},
"tip_amount": {
"type": "number"
},
"tolls_amount": {
"type": "number"
},
"total_amount": {
"type": "number"
}
}
}
}
}
Loading