From charlesreid1

No edit summary
No edit summary
 
(10 intermediate revisions by the same user not shown)
Line 1: Line 1:
Review in preparation for interview:
==Review Notes Pages==
* Components of workflow and open source tools for each step
* Highlight each step with a data engineering repository
* Individual services offered on the cloud - know the idea behind, e.g., why so many database solutions
* What specific challenges, software, workflows do genomics researchers face/use?


[[Google Cloud/Scientific Data Processing]] - doing the scientific data processing qwiklab


Process:
Review page: [[Google Cloud/Review]]


Case study
==Data Engineering Scenarios Review==
* Start by reviewing the logistics company case study
* https://charlesreid1.com/wiki/Google_Cloud/Case_Study


Software tools
Project 1: [[2018/January/Data Engineering/Scientific Data Processing]]
* Basic software technologies: storage, databases, distributed computation, GPUs vs CPUs, Docker/containerization
* https://charlesreid1.com/wiki/Google_Cloud
* Google Cloud Genomics


Software Quality Assurance
Project 2: [[2018/January/Data Engineering/Big Data Text Processing]]
* Github pages/10 things list (time machine)


GCDEC Review:
Project 3: [[2018/January/Data Engineering/Cosmos]]
* 1 - https://charlesreid1.com/wiki/GCDEC/Fundamentals/Notes
 
* 2 - https://charlesreid1.com/wiki/GCDEC/Unstructured_Data/Notes
==Flags==
* 3a - https://charlesreid1.com/wiki/GCDEC/BigQuery/Notes
 
* 3b - https://charlesreid1.com/wiki/GCDEC/Dataflow/Notes
[[Category:Google Cloud]]
* 4a - https://charlesreid1.com/wiki/GCDEC/Building_Tensorflow/Notes
[[Category:Data Engineering]]
* 4b - https://charlesreid1.com/wiki/GCDEC/Deploying_Tensorflow/Notes
* 4c - https://charlesreid1.com/wiki/GCDEC/Engineering_Tensorflow/Notes
* 5 - https://charlesreid1.com/w/index.php?title=GCDEC/Streaming/Notes&action=edit&redlink=1

Latest revision as of 23:25, 11 January 2018

Review Notes Pages

Google Cloud/Scientific Data Processing - doing the scientific data processing qwiklab

Review page: Google Cloud/Review

Data Engineering Scenarios Review

Project 1: 2018/January/Data Engineering/Scientific Data Processing

Project 2: 2018/January/Data Engineering/Big Data Text Processing

Project 3: 2018/January/Data Engineering/Cosmos

Flags