From the course: Apache Iceberg for Data Analytics and Machine Learning

Unlock this course with a free trial

Join today to access over 24,300 courses taught by industry experts.

Setting up your project

Setting up your project

- We're going to start by setting up our environment. We'll need to do a few things. First, navigate to github.com\andymadson\linkedinlearning. Here you'll find the files that you need for today's exercises. Let me quickly tell you what each file does and why we need it. You have a Read Me, which just states that these are the files for this course. We have our Docker composed file, which is going to launch the services within Docker that we're going to use, such as Dremio and Jupyter Notebooks. We have some SQL code to clean up the files and create some views. And we have a Jupyter Notebook where you can follow along and use the code in there to do your analytics and machine learning. Let's get started. Start by opening up VSCode, creating a new empty folder. Mine is called DREMIO DEMO. And we need to create our Docker-Compose YAML file. Copy and paste the contents from the GitHub Docker-Compose file into our VSCode. I'm going to use the right click Compose Up feature within VSCode…

Contents