Tag: Data Science

AI Platform Cloud SQL Data Science Sept. 28, 2020

Accessing Cloud SQL Data from AI Platform using Python - This article talks about a workaround to access data in Cloud SQL DB from the AI Platform.

BigQuery Data Science GIS Sept. 21, 2020

A beginner’s Guide to Google’s BigQuery GIS - Get started free with Google Big Query GIS with this step by step tutorial.

Big Data BigQuery Data Science Aug. 31, 2020

Google Cloud for Genomics - Building a scalable, reproducible, and secure data processing pipeline on the cloud.

Data Science Aug. 31, 2020

How I passed the Google Professional Data Engineer Exam in 2020 - In 8 days. Quick learner’s guide for those who don’t have time to read the manuals. August 2020.

Data Science Machine Learning Aug. 31, 2020

Managing Your Machine Learning Experiments with MLflow - Deploying MLflow server on GCP.

Data Science Machine Learning Aug. 17, 2020

Scalable Machine Learning with Dask on Google Cloud - A great addition to your arsenal of data science tools, Dask provides you advanced parallelism for computation at scale.

BigQuery Data Science Aug. 10, 2020

Yet another way to generate fake datasets in BigQuery - Wrapping faker.js with a Javascript UDF.

BigQuery Data Science Public Datasets July 27, 2020

Data Science 101 for Startups- Aggregation in SQL — Part 2 - Using aggregation SQL functions on BigQuery public dataset.

Cloud Dataproc Data Science Jupyter Notebook Tutorial July 27, 2020

Getting Started with Jupyter + Spark on the Cloud in 2020 - Spinning up Spark clusters with Jupyter on Cloud Dataproc.

Data Science Machine Learning July 27, 2020

Building a Data Platform to Enable Analytics and AI-Driven Innovation - Build a Data Mesh & Set up MLOps.

BigQuery Data Science Python July 20, 2020

BigQuery + Python for Production Data Science - Accessing BigQuery using Pandas, PySpark, and OS/Python.

BigQuery Data Science Public Datasets July 20, 2020

Data Science 101 for Startups- Aggregation in SQL - Aggregations concepts on examples from BigQuery.

BigQuery Data Science Machine Learning Public Datasets July 13, 2020

Stack Overflow in 2023: Predicting with ARIMA and BigQuery - Predicting the top Stack Overflow tags with ARIMA model in BigQuery.

Data Science Machine Learning Tutorial July 13, 2020

Building Image Detection with Google Cloud AutoML - Building "snack classifier" with AutoML Vision.

BigQuery Data Science July 6, 2020

Get started with BigQuery and dbt, the easy way - Find here the quickest way to get started with dbt and BigQuery using only free offerings from Google Cloud.

AI Platform Data Science July 6, 2020

Using GCP’s AI Platform to Predict Customer Churn - Developing a classification model to address customer churn.

BigQuery Cloud Functions Data Science Python July 6, 2020

Part 2: Building a Simple ETL Pipeline with Python and Google Cloud Functions — MySQL to BigQuery - Extracting data from a MySQL database and loading into Google BigQuery using Google Cloud Functions.

BigQuery Data Science Machine Learning July 6, 2020

Visualizing Pitcher Clusters: A Next OnAir Digital Experience - Analyzing baseball pitchers.

BigQuery Data Science July 3, 2020

How to handle Google Analytics data in BigQuery - The ways & tricks to tackle Shaded Tables and ARRAYs in BigQuery tables.

Data Science Machine Learning Python TensorFlow July 3, 2020

Model with TensorFlow and Serve on Google Cloud Platform - Serving TensorFlow Models on a scalable cloud platform.

BigQuery Data Science June 29, 2020

BigQuery: Creating Nested Data with SQL - Working with SQL on nested data in BigQuery can be very performant. But what if your data comes in flat tables like CSV’s?

BigQuery Data Science June 29, 2020

Easy pivot() in BigQuery, finally - Using dynamic SQL and stored procedures to pivot in BigQuery.

BigQuery Data Science June 29, 2020

Custom cohort size using Range Bucket in SQL. - Using RANGE_BUCKET command in BigQuery.

BigQuery Data Science Public Datasets June 15, 2020

Intro to BigQuery and its Free Data Sets - A quick introduction on how to access and query Google’s BigQuery using their free public datasets.

BigQuery Data Science June 8, 2020

Zero to Differential Privacy in 5 minutes on Google BigQuery - Differential Privacy presents a framework for asking statistical questions about a dataset while provably maintaining the privacy of the entities within that dataset.

BigQuery Data Science June 1, 2020

The Best Way to Generate Indices in BigQuery - Using GENERATE_ARRAY for Histograms and More.

AI Platform Notebooks Big Data Data Science Machine Learning June 1, 2020

Hands-on Big Data Analysis on GCP Using AI Platform Notebooks - Example of working with AI Platform Notebooks.

Cloud Composer Compute Engine Data Science May 18, 2020

Airflow on GCP (May 2020) - This is a complete guide to install Apache Airflow on a Google Cloud Platform Virtual Machine from scratch.

Big Data Data Catalog Data Science May 18, 2020

Google Cloud Data Catalog — Integrate Your On-Prem RDBMS Metadata - Code samples with a practical approach on how to ingest metadata from on-premise Relational Databases into Google Cloud Data Catalog.

Data Science Machine Learning Serverless May 11, 2020

13 Most Common Google Cloud Reference Architectures - Summary of #13DaysOfGCP architecture Twitter series.

BigQuery Data Science April 27, 2020

How to UNPIVOT multiple columns into tidy pairs with SQL and BigQuery - This post is for anyone dealing with time series in CSVs with one new column for each day.

BigQuery Data Science Data Studio Visualization April 27, 2020

Empowering Apple Mobility Trends Reports with BigQuery and Data Studio - Analyzing Apple's mobility data using BigQuery and Data Studio.

BigQuery Cloud Dataproc Data Science Jupyter Notebook March 16, 2020

Apache Spark and Jupyter Notebooks made easy with Dataproc component gateway - Make use of the new Dataproc optional components and component gateway features to easily use Jupyter Notebooks.

BigQuery Data Science Public Datasets March 16, 2020

Data analysis with SQL and BigQuery on New york city bikes data. - Starting with New York biking open data analysis.

Data Science Jupyter Notebook Machine Learning March 16, 2020

Setting Up Jupyter on Google Cloud - A scriptable list of command lines to deploy Jupyter in Google Cloud, securely and cost-effectively, with added exercises.

Beginner Cloud Composer Cloud Dataproc Data Science March 9, 2020

A gentle introduction to Data Workflows with Apache Airflow and Apache Spark - A tutorial on using Cloud Composer (Airflow) to launch Spark jobs on Cloud Dataproc.

AI Platform AI Platform Notebooks Data Science March 9, 2020

Reducing Startup Time For Notebooks With Custom Containers - Have you ever tried to use Cloud AI Platform Notebooks with huge containers?

BigQuery Data Science March 2, 2020

What do party schools and energy efficiency have in common? - Using BigQuery to analyze public data on building energy use.

AI Platform Data Science Docker Machine Learning Python March 2, 2020

Serverless machine learning using Docker - Running containers in Google AI Platform.

Data Science Serverless March 2, 2020

Introducing Serverless Orchestration with Houston - Serverless workflow control on Google Cloud Platform.

BigQuery Data Science Data Studio Feb. 24, 2020

Reddit AmItheAsshole is nicer to women than to men — a SQL proof? - Analyzing Reddit posts with BigQuery and visualizing in Data Studio.

Compute Engine Data Science Feb. 24, 2020

Jupyter Notebook on Google Compute Engine with HTTPS - Setting up Jupyter to run on Google Compute Engine and be accessed via HTTPS.

Data Science Machine Learning Feb. 17, 2020

All things GCP: Machine Learning Decision pyramid - Understand which Google Cloud tools matches best for you.

Apache Beam BigQuery Data Science Jan. 27, 2020

Fastai batch prediction on a BigQuery table - From this article, you will get to know how to perform a batch prediction on a BigQuery table using a fastai model.

BigQuery Data Science Data Studio Jan. 27, 2020

Interactive: The top 2019 Wikipedia pages - Going deeper into Wikipedia most popular pages for 2019 with BigQuery and Data Studio.

BigQuery Data Science Jan. 27, 2020

Inequality: How to draw a Lorenz curve with SQL, BigQuery, and Data Studio - Analyzing the popularity of Wikipedia pages based on public data set.

AI Platform Data Science Machine Learning Python Jan. 20, 2020

Using Scikit-learn on Google Cloud Platform - Training Scikit-learn models on GCP’s AI Platform.

Data Science Dec. 16, 2019

This is how you put the data in Data Science! - Google's search engine for Datasets.

BigQuery Data Science Dec. 9, 2019

Advent of code: SQL + BigQuery - Solving the Advent of Code challenges with SQL and BigQuery.

Data Science Machine Learning Dec. 2, 2019

Get started or improve your Machine Learning of structured data using AutoML Tables! (Part 1) - Challenges we are trying to solve and part 2 will go…

AI Platform Data Science Machine Learning Python Nov. 25, 2019

Predicting Taxi fares in NYC using Google Cloud AI Platform (Billion + rows) Part 3 - The objective of this series of articles is to create a Machine Learning model that is able to estimate taxi fares in NYC before the ride commences.

Big Data BigQuery Data Science GCP Experience Nov. 18, 2019

Batch Processing Pipelines for Better Data Analysis - An overview of how Gojek is using batch processing to generate useful insights from our data warehouse.

Big Data BigQuery Data Science Nov. 18, 2019

BigQuery workflow from the Jupyter notebook - In this article, you will get to know how to create and schedule the BigQuery workflow using the Jupyter Lab and the Cloud Composer.

App Engine BigQuery Data Science Python Nov. 18, 2019

Python / Pandas & BigQuery in 7 minutes - Using BigQuery in Django app.

Data Catalog Data Science Nov. 18, 2019

Boosting the Data Governance journey with Google Cloud Data Catalog - Thoughts on data discovery and metadata management in Google Cloud.

Data Science Kubernetes Machine Learning Nov. 18, 2019

MiniKF is now available on the GCP Marketplace - MiniKF is the fastest and easiest way to get started with Kubeflow. With just a few clicks, you are ready for experimentation, and for running complete Kubeflow Pipelines.

BigQuery Data Science Nov. 11, 2019

Anomaly Detection With SQL - Demonstrating SQL anomaly detection on a public dataset in BigQuery.

BigQuery Data Science Machine Learning Nov. 11, 2019

ML Design Pattern #5: Repeatable sampling - Use a well-distributed column to split your data into a train/valid/test.

BigQuery Data Science Data Studio Nov. 4, 2019

Analyzing the crisis with reddit and BigQuery: 2019 Chilean protests - Analyzing and visualizing data from Reddit with BigQuery and Data Studio.

Big Data BigQuery Data Science Nov. 4, 2019

Let the kids into the library - An opinionated attempt at building a data driven company in the cloud.

Beginner Data Science Machine Learning Nov. 4, 2019

Using a cluster in the cloud for Data Science projects in 4 simple steps - Tutorial on how to set up Jupyter notebook on GCP.

Big Data BigQuery Data Science Python Oct. 28, 2019

How to get into BigQuery analysis on Kaggle with Python? - Exploring ways to use BigQuery in Kaggle.

Big Data Data Science Oct. 28, 2019

A gentle introduction to Apache Druid in Google Cloud Platform - The article describes how to set up and use Apache Druid on GCP.

Data Science Machine Learning TensorFlow Oct. 28, 2019

Predicting Taxi fares in NYC using Google Cloud AI Platform (Billion + rows) Part 2 - Using data from BigQuery to create a Tensorflow model of predicting taxi fares in NYC.

BigQuery Data Science Sept. 30, 2019

10 top tips: Unleash your BigQuery superpowers - If BigQuery was superhero, what kind of superpowers would it have?

BigQuery Data Science Sept. 16, 2019

Loading MySQL backup files into BigQuery — straight from Cloud SQL - Loading Cloud SQL MySQL backup data into BigQuery.

BigQuery Data Science Python Sept. 2, 2019

Slow BigQuery results no more - How the use of BigQuery Storage API improves the speed of results retrieving from BigQuery.

BigQuery Data Science Data Studio Aug. 19, 2019

Don’t Double Park in Brooklyn - Analyzing New York's open data about state vehicle registration using BigQuery and Data Studio.

AI Data Science Machine Learning Aug. 19, 2019

How to Upgrade Colab with More Compute - Learn how to use Google Cloud Platform’s Deep Learning VMs to power up your Colab environment, on this episode of AI Adventures

Data Science Aug. 12, 2019

4 Data Studio tricks - UX and UI tips for Data Studio.

BigQuery Data Science July 29, 2019

BigQuery: SQL on Nested Data - Examples of working with nested data in BigQuery.

BigQuery Data Science Machine Learning July 29, 2019

Clustering 4,000 Stack Overflow tags with BigQuery k-means - Using BigQuery ML to cluster tags from StackOverflow.

Big Data BigQuery Data Science Java July 15, 2019

Beast: Moving Data from Kafka to BigQuery - GOJEK’s open source solution for moving data from Kafka to Google BigQuery.

Data Science DevOps Kubernetes Machine Learning July 15, 2019

Automated Model Retraining with Kubeflow Pipelines - How to implement a reproducible ML workflow that adapts to new data

BigQuery Data Science July 8, 2019

New in BigQuery: Persistent UDFs - Using new functionality of saving User Defined Functions in BigQuery.

BigQuery Data Science Python July 8, 2019

BigQuery and Public Datasets. Overview for Data Analysts - In this article we’ll briefly explore what is BigQuery and how a data analyst can access and use it through various interfaces with…

BigQuery Data Science July 8, 2019

An open source Python package for moving HelpScout data into Google BigQuery - This article is written for business analysts, data scientists and engineers that need to integrate Help Scout data into their Google BigQuery pipeline, and have hands-on experience dealing with Python, APIs and SQL databases.

Big Data Data Analytics Data Catalog Data Science July 8, 2019

Google Cloud Data Catalog hands-on guide: templates & tags with Python - This quickstart guide brings a practitioner approach to Data Catalog, covering Templates & Tags management using the Python client library.

BigQuery Data Science Data Studio June 24, 2019

From College to the Pros with Google Cloud Platform (Part 1) - Getting together and analyzing NBA players stats.

Data Science GCP Certification June 24, 2019

10 Days to Become a Google Cloud Certified Professional Data Engineer - Overview of resources used for Data Engineer exam preparation.

Data Science R June 24, 2019

From College to the Pros with Google Cloud Platform (Part 2) - The second part of NBA players analysis.

Cloud Dataproc Data Science June 17, 2019

Scale out RAPIDS on Google Cloud Dataproc - Scaling GPU data jobs on Cloud Dataproc.

BigQuery Data Science GCP Experience June 17, 2019

Analytics at lightspeed with Google BigQuery - The article describes how Aditya Birla Group created a digital platform on GCP to manage the travel of their employees.

Data Science June 17, 2019

Setup Julia with Jupyter notebook on Google Cloud Platform - Tutorial on how to set up and use Julia on Jupyter notebooks hosted on GCP.

Data Science Security June 10, 2019

How to use cloud storage to securely load data into Neo4j - Methods for loading data into a remote Neo4j Instance — Part 2

Apache Beam Cloud Dataflow Data Science Python May 13, 2019

Let’s Build a Streaming Data Pipeline - Creating Apache Beam / DataFlow pipeline to parse web server logs.

Data Science GCP Certification May 13, 2019

Passing the (new) Google Professional Data Engineer exam within 7 weeks - Experience of preparing and taking Data Engineer certification.

Data Science April 15, 2019

How to get started with Google Colab and Kaggle - Example of using Colab for Kaggle competitions.

AI Data Science Machine Learning April 8, 2019

GCP Notebook Executor v0.1.2 - Executing long running Jupyter Notebook jobs on GCP.

BigQuery Cloud Dataflow Cloud Dataprep Data Science Machine Learning TensorFlow April 8, 2019

End-to-end churn prediction on Google Cloud Platform - Overview of GCP architecture to build customer churn prediction compromising of data acquisition, data wrangling, modeling, model deployment, and a business use case.

 

Latest Issues




Contact

Zdenko Hrček
Třebanická 183
Prague, Czech Republic
Phone: +420 777 283 075
Email: zdenko@gcpweekly.com