Department of Transport and Main Roads (TMR)
October 2019 - November 2019
CAVI (Cooperative and Automated Vehicle Initiative) project is being developed by TMR (Transport and Main Roads) which mainly focuses on new vehicle technologies including safety mobility and environmental benefits onQueensland Road. Their Goal is to test cooperative and automated vehicle technologies that make roads safer by contributing towards a vision of zero road deaths and serious injuries on the state’s roads.
Cloudten was involved to assist TMR with the Industry best practices and to review their current Data analytics pipeline and their large datastore
As part of the engagement Cloudten provided the following,
As part of this engagement, Cloudten. Successfully delivered the proposed solution. CAVI account now has an ETL job that is quicker efficient and is run with minimal time. Data streaming was enabled for some of the services from source to their ETL Tools for further processing. Major fixes in the legacy code.
DescribeTCO Analysis Performed:
Analysed current state of run costs on Glue Crawlers and Jobs
Glue Crawler takes a longer time and it crawls entire data set causing overhead and additional costs, this was mitigated by moving Glue crawler functionality within AWS Glue Jobs.
Glue Data CatLog – Glue Crawlers were used to populate AWS Glue data CatLog in the tables
Glue jobs – Transform data from one form to another; CSV toParquet
Glue database – As Datastore
DynamoDB -- Stores table Schema
Glue Endpoints – Connect with the local Zeppelin notebooks for debugging.
Athena – Create queries and views for Tableau
S3 - Data lake
IAM – Create restricted policies and Roles to be consumed by AWSServices and end users
CloudFormation – Deploy the pipeline including Glue jobs, crawlers, Lambda Functions, S3buckets
Code Commit-- StoreCode for CloudFormation templates, JSON files for DynamoDB schema, LambdaFunctions
Lambda – Functions to validate Schema, copy files from Raw toStaging
Step Function – Drives lambda function