Tag: Data Science

API BigQuery Data Science Jan. 18, 2021

Read/Write From Any Google API To/From BigQuery In 1 Minute Using BQ Flow - Use BQ Flow to transfer data between any Google API (Campaign Manager, Adwords API, Display Video) and.

Big Data BigQuery Data Science Jan. 18, 2021

BigQuery Hack: 1000x More Efficient Aggregation Using Materialized View - Learn how to supercharge your aggregation queries using Materialized View.

BigQuery Cloud AutoML Data Science Machine Learning Jan. 18, 2021

Comparing Custom Model Development With GCP BQML and AutoML Tables - Comparing Custom Model Development on Python Jupyter notebook with Google Cloud Platform BigQuery Machine Learning and AutoML Tables (beta).

AI Platform Notebooks Big Data Data Science GPU Jan. 18, 2021

An Accelerated Big Data Workflow for the Data Analyst - Explore and analyze 1B loan records with RAPIDS & Nvidia A100 GPUs on Cloud AI Platform.

BigQuery Data Science Python Jan. 4, 2021

A gentle introduction to the 5 Google Cloud BigQuery APIs - An overview of BigQuery APIs / client libraries.

BigQuery Data Science Machine Learning Jan. 4, 2021

K-Means Clustering in Google BigQuery ML - A complete guide on the most popular and practical clustering technique natively in Google BigQuery (database+ML).

Cloud Dataproc Data Science Machine Learning Jan. 4, 2021

All you need to know about Google Cloud Dataproc? - Managed Hadoop & Spark #GCPSketchnote.

AI Platform Notebooks Beginner Data Science Jupyter Notebook Dec. 28, 2020

How To Use Google AI Platform Notebooks For Your Data Science Team - Getting started with Google AI Platform Notebooks.

Apache Beam BigQuery Cloud Dataflow Data Science Dataflow Jupyter Notebook Machine Learning Python Dec. 21, 2020

Getting started with Machine Learning on GCP — Part 2: Making data clean and usable - Creating Beam/Dataflow pipeline in Jupyter Notebook.

Data Science Machine Learning Python TensorFlow Dec. 21, 2020

A machine learning pipeline with TensorFlow Estimators and Google Cloud Platform - TensorFlow on GCP — a way to industrialise complex machine learning pipelines.

BigQuery Data Science Data Studio Dec. 21, 2020

Create a real time Dashboard on covid-19 in France with GCP - Using public API to create Covid dashboard in Data Studio.

BigQuery Data Science Dec. 14, 2020

Time series analytics with BigQuery part 2 - The second in a series of posts on implementing time series analytics in BigQuery, this time defining sliding windows and session windows.

BigQuery Data Science Dec. 14, 2020

5 Bigquery SQL performance tips for modern data scientists - SQL tuning tips and advice to help reduce BigQuery costs. Start 2021 off on the right foot!

AI Platform Data Science Machine Learning Nov. 30, 2020

Google Cloud AI Platform: Human Data labeling-as-a-Service Part 2 - Exploring Google’s (human) Data Labelling Service for Advanced Video Labelling.

AI Platform Cloud AutoML Data Science Kaggle Nov. 30, 2020

Kaggle: Man vs Machine - Using AI Platform to identify healthy plants in Kaggle competition.

Billing Data Science Nov. 30, 2020

Isolating trends in public cloud costs using time-series analysis - AWS or Google Cloud costs can be often somewhat confusing and it’s hard to “cut through the noise” to see what really matters.

AI AI Platform Data Science Nov. 22, 2020

Google Cloud AI Platform Unified - Launched on 16 Nov 2020, AI Platform Unified caught us by surprise. Learn exactly what’s been “unified”.

Data Science GCP Certification Nov. 22, 2020

How to pass Google Cloud Platform — Professional Data engineer exams - Preparing for Professional Data Engineer exam.

Cloud Healthcare API Data Science Machine Learning Nov. 22, 2020

Google Cloud Healthcare API - Learn how this can accelerate AI solutions to benefit modern medicine.

BigQuery Data Science Nov. 16, 2020

Time series analytics with BigQuery - Techniques for tumbles, fills, and interpolation.

App Engine Cloud Run Data Science Firebase Python Nov. 16, 2020

Deploying a Python Dash app on App Engine with a Flask/Cloud Run backend and Firebase auth - Learn how to deploy a beautiful dashboard using Python and Dash on GCP. Then add user authentication with Firebase.

Data Science Machine Learning TPU Nov. 16, 2020

Running BERT on Google Cloud Platform With TPU - Use Google Cloud and TPU to Build a Deep Learning Environment.

AI Data Science Machine Learning Nov. 9, 2020

Google Cloud AI Platform: Hyper-Accessible AI & Machine Learning - In this first article of the series, we present an over of Google AI Platform, exploring the services available to modern data science.

Data Science Security Nov. 2, 2020

Understanding Data Encryption in Google Cloud - GCP Comics #4: Encryption to secure your data in cloud.

Apache Beam Cloud Dataflow Data Science Oct. 26, 2020

Dataflow and Apache Beam, the Result of a Learning Process Since MapReduce - An overview of Apache Beam and Cloud Dataflow.

BigQuery Data Science Oct. 19, 2020

Explore Public Datasets with Google BigQuery and DataStudio - Exploring and Reporting Massive Datasets Right Inside Your Web-browser — With an example of COVID-19 Dataset.

Data Science GCP Certification Oct. 19, 2020

How To Pass Google Cloud Professional Data Engineer Exam without IT background. - Passing Data Engineer certification exam with non-IT background.

AI Platform Prediction Data Science Machine Learning TensorFlow Oct. 12, 2020

Lightweight yet scalable TensorFlow workflow on Google Cloud - My superpower toolkit: TFRecorder, TensorFlow Cloud, AI Platform Predictions and Weights & Biases.

Cloud Build Data Science Looker Machine Learning Oct. 5, 2020

Operationalizing BigQuery ML through Cloud Build and Looker - Implementing MLOps with BigQuery ML, Cloud Build and Looker.

AI Platform Cloud SQL Data Science Sept. 28, 2020

Accessing Cloud SQL Data from AI Platform using Python - This article talks about a workaround to access data in Cloud SQL DB from the AI Platform.

BigQuery Data Science GIS Sept. 21, 2020

A beginner’s Guide to Google’s BigQuery GIS - Get started free with Google Big Query GIS with this step by step tutorial.

Big Data BigQuery Data Science Aug. 31, 2020

Google Cloud for Genomics - Building a scalable, reproducible, and secure data processing pipeline on the cloud.

Data Science Aug. 31, 2020

How I passed the Google Professional Data Engineer Exam in 2020 - In 8 days. Quick learner’s guide for those who don’t have time to read the manuals. August 2020.

Data Science Machine Learning Aug. 31, 2020

Managing Your Machine Learning Experiments with MLflow - Deploying MLflow server on GCP.

Data Science Machine Learning Aug. 17, 2020

Scalable Machine Learning with Dask on Google Cloud - A great addition to your arsenal of data science tools, Dask provides you advanced parallelism for computation at scale.

BigQuery Data Science Aug. 10, 2020

Yet another way to generate fake datasets in BigQuery - Wrapping faker.js with a Javascript UDF.

BigQuery Data Science Public Datasets July 27, 2020

Data Science 101 for Startups- Aggregation in SQL — Part 2 - Using aggregation SQL functions on BigQuery public dataset.

Cloud Dataproc Data Science Jupyter Notebook Tutorial July 27, 2020

Getting Started with Jupyter + Spark on the Cloud in 2020 - Spinning up Spark clusters with Jupyter on Cloud Dataproc.

Data Science Machine Learning July 27, 2020

Building a Data Platform to Enable Analytics and AI-Driven Innovation - Build a Data Mesh & Set up MLOps.

BigQuery Data Science Python July 20, 2020

BigQuery + Python for Production Data Science - Accessing BigQuery using Pandas, PySpark, and OS/Python.

BigQuery Data Science Public Datasets July 20, 2020

Data Science 101 for Startups- Aggregation in SQL - Aggregations concepts on examples from BigQuery.

BigQuery Data Science Machine Learning Public Datasets July 13, 2020

Stack Overflow in 2023: Predicting with ARIMA and BigQuery - Predicting the top Stack Overflow tags with ARIMA model in BigQuery.

Data Science Machine Learning Tutorial July 13, 2020

Building Image Detection with Google Cloud AutoML - Building "snack classifier" with AutoML Vision.

BigQuery Data Science July 6, 2020

Get started with BigQuery and dbt, the easy way - Find here the quickest way to get started with dbt and BigQuery using only free offerings from Google Cloud.

AI Platform Data Science July 6, 2020

Using GCP’s AI Platform to Predict Customer Churn - Developing a classification model to address customer churn.

BigQuery Cloud Functions Data Science Python July 6, 2020

Part 2: Building a Simple ETL Pipeline with Python and Google Cloud Functions — MySQL to BigQuery - Extracting data from a MySQL database and loading into Google BigQuery using Google Cloud Functions.

BigQuery Data Science Machine Learning July 6, 2020

Visualizing Pitcher Clusters: A Next OnAir Digital Experience - Analyzing baseball pitchers.

BigQuery Data Science July 3, 2020

How to handle Google Analytics data in BigQuery - The ways & tricks to tackle Shaded Tables and ARRAYs in BigQuery tables.

Data Science Machine Learning Python TensorFlow July 3, 2020

Model with TensorFlow and Serve on Google Cloud Platform - Serving TensorFlow Models on a scalable cloud platform.

BigQuery Data Science June 29, 2020

BigQuery: Creating Nested Data with SQL - Working with SQL on nested data in BigQuery can be very performant. But what if your data comes in flat tables like CSV’s?

BigQuery Data Science June 29, 2020

Easy pivot() in BigQuery, finally - Using dynamic SQL and stored procedures to pivot in BigQuery.

BigQuery Data Science June 29, 2020

Custom cohort size using Range Bucket in SQL. - Using RANGE_BUCKET command in BigQuery.

BigQuery Data Science Public Datasets June 15, 2020

Intro to BigQuery and its Free Data Sets - A quick introduction on how to access and query Google’s BigQuery using their free public datasets.

BigQuery Data Science June 8, 2020

Zero to Differential Privacy in 5 minutes on Google BigQuery - Differential Privacy presents a framework for asking statistical questions about a dataset while provably maintaining the privacy of the entities within that dataset.

BigQuery Data Science June 1, 2020

The Best Way to Generate Indices in BigQuery - Using GENERATE_ARRAY for Histograms and More.

AI Platform Notebooks Big Data Data Science Machine Learning June 1, 2020

Hands-on Big Data Analysis on GCP Using AI Platform Notebooks - Example of working with AI Platform Notebooks.

Cloud Composer Compute Engine Data Science May 18, 2020

Airflow on GCP (May 2020) - This is a complete guide to install Apache Airflow on a Google Cloud Platform Virtual Machine from scratch.

Big Data Data Catalog Data Science May 18, 2020

Google Cloud Data Catalog — Integrate Your On-Prem RDBMS Metadata - Code samples with a practical approach on how to ingest metadata from on-premise Relational Databases into Google Cloud Data Catalog.

Data Science Machine Learning Serverless May 11, 2020

13 Most Common Google Cloud Reference Architectures - Summary of #13DaysOfGCP architecture Twitter series.

BigQuery Data Science April 27, 2020

How to UNPIVOT multiple columns into tidy pairs with SQL and BigQuery - This post is for anyone dealing with time series in CSVs with one new column for each day.

BigQuery Data Science Data Studio Visualization April 27, 2020

Empowering Apple Mobility Trends Reports with BigQuery and Data Studio - Analyzing Apple's mobility data using BigQuery and Data Studio.

BigQuery Cloud Dataproc Data Science Jupyter Notebook March 16, 2020

Apache Spark and Jupyter Notebooks made easy with Dataproc component gateway - Make use of the new Dataproc optional components and component gateway features to easily use Jupyter Notebooks.

BigQuery Data Science Public Datasets March 16, 2020

Data analysis with SQL and BigQuery on New york city bikes data. - Starting with New York biking open data analysis.

Data Science Jupyter Notebook Machine Learning March 16, 2020

Setting Up Jupyter on Google Cloud - A scriptable list of command lines to deploy Jupyter in Google Cloud, securely and cost-effectively, with added exercises.

Beginner Cloud Composer Cloud Dataproc Data Science March 9, 2020

A gentle introduction to Data Workflows with Apache Airflow and Apache Spark - A tutorial on using Cloud Composer (Airflow) to launch Spark jobs on Cloud Dataproc.

AI Platform AI Platform Notebooks Data Science March 9, 2020

Reducing Startup Time For Notebooks With Custom Containers - Have you ever tried to use Cloud AI Platform Notebooks with huge containers?

BigQuery Data Science March 2, 2020

What do party schools and energy efficiency have in common? - Using BigQuery to analyze public data on building energy use.

AI Platform Data Science Docker Machine Learning Python March 2, 2020

Serverless machine learning using Docker - Running containers in Google AI Platform.

Data Science Serverless March 2, 2020

Introducing Serverless Orchestration with Houston - Serverless workflow control on Google Cloud Platform.

BigQuery Data Science Data Studio Feb. 24, 2020

Reddit AmItheAsshole is nicer to women than to men — a SQL proof? - Analyzing Reddit posts with BigQuery and visualizing in Data Studio.

Compute Engine Data Science Feb. 24, 2020

Jupyter Notebook on Google Compute Engine with HTTPS - Setting up Jupyter to run on Google Compute Engine and be accessed via HTTPS.

Data Science Machine Learning Feb. 17, 2020

All things GCP: Machine Learning Decision pyramid - Understand which Google Cloud tools matches best for you.

Apache Beam BigQuery Data Science Jan. 27, 2020

Fastai batch prediction on a BigQuery table - From this article, you will get to know how to perform a batch prediction on a BigQuery table using a fastai model.

BigQuery Data Science Data Studio Jan. 27, 2020

Interactive: The top 2019 Wikipedia pages - Going deeper into Wikipedia most popular pages for 2019 with BigQuery and Data Studio.

BigQuery Data Science Jan. 27, 2020

Inequality: How to draw a Lorenz curve with SQL, BigQuery, and Data Studio - Analyzing the popularity of Wikipedia pages based on public data set.

AI Platform Data Science Machine Learning Python Jan. 20, 2020

Using Scikit-learn on Google Cloud Platform - Training Scikit-learn models on GCP’s AI Platform.

Data Science Dec. 16, 2019

This is how you put the data in Data Science! - Google's search engine for Datasets.

BigQuery Data Science Dec. 9, 2019

Advent of code: SQL + BigQuery - Solving the Advent of Code challenges with SQL and BigQuery.

Data Science Machine Learning Dec. 2, 2019

Get started or improve your Machine Learning of structured data using AutoML Tables! (Part 1) - Challenges we are trying to solve and part 2 will go…

AI Platform Data Science Machine Learning Python Nov. 25, 2019

Predicting Taxi fares in NYC using Google Cloud AI Platform (Billion + rows) Part 3 - The objective of this series of articles is to create a Machine Learning model that is able to estimate taxi fares in NYC before the ride commences.

Big Data BigQuery Data Science GCP Experience Nov. 18, 2019

Batch Processing Pipelines for Better Data Analysis - An overview of how Gojek is using batch processing to generate useful insights from our data warehouse.

Big Data BigQuery Data Science Nov. 18, 2019

BigQuery workflow from the Jupyter notebook - In this article, you will get to know how to create and schedule the BigQuery workflow using the Jupyter Lab and the Cloud Composer.

App Engine BigQuery Data Science Python Nov. 18, 2019

Python / Pandas & BigQuery in 7 minutes - Using BigQuery in Django app.

Data Catalog Data Science Nov. 18, 2019

Boosting the Data Governance journey with Google Cloud Data Catalog - Thoughts on data discovery and metadata management in Google Cloud.

Data Science Kubernetes Machine Learning Nov. 18, 2019

MiniKF is now available on the GCP Marketplace - MiniKF is the fastest and easiest way to get started with Kubeflow. With just a few clicks, you are ready for experimentation, and for running complete Kubeflow Pipelines.

BigQuery Data Science Nov. 11, 2019

Anomaly Detection With SQL - Demonstrating SQL anomaly detection on a public dataset in BigQuery.

BigQuery Data Science Machine Learning Nov. 11, 2019

ML Design Pattern #5: Repeatable sampling - Use a well-distributed column to split your data into a train/valid/test.

BigQuery Data Science Data Studio Nov. 4, 2019

Analyzing the crisis with reddit and BigQuery: 2019 Chilean protests - Analyzing and visualizing data from Reddit with BigQuery and Data Studio.

Big Data BigQuery Data Science Nov. 4, 2019

Let the kids into the library - An opinionated attempt at building a data driven company in the cloud.

Beginner Data Science Machine Learning Nov. 4, 2019

Using a cluster in the cloud for Data Science projects in 4 simple steps - Tutorial on how to set up Jupyter notebook on GCP.

Big Data BigQuery Data Science Python Oct. 28, 2019

How to get into BigQuery analysis on Kaggle with Python? - Exploring ways to use BigQuery in Kaggle.

Big Data Data Science Oct. 28, 2019

A gentle introduction to Apache Druid in Google Cloud Platform - The article describes how to set up and use Apache Druid on GCP.

Data Science Machine Learning TensorFlow Oct. 28, 2019

Predicting Taxi fares in NYC using Google Cloud AI Platform (Billion + rows) Part 2 - Using data from BigQuery to create a Tensorflow model of predicting taxi fares in NYC.

BigQuery Data Science Sept. 30, 2019

10 top tips: Unleash your BigQuery superpowers - If BigQuery was superhero, what kind of superpowers would it have?

BigQuery Data Science Sept. 16, 2019

Loading MySQL backup files into BigQuery — straight from Cloud SQL - Loading Cloud SQL MySQL backup data into BigQuery.

BigQuery Data Science Python Sept. 2, 2019

Slow BigQuery results no more - How the use of BigQuery Storage API improves the speed of results retrieving from BigQuery.

BigQuery Data Science Data Studio Aug. 19, 2019

Don’t Double Park in Brooklyn - Analyzing New York's open data about state vehicle registration using BigQuery and Data Studio.

AI Data Science Machine Learning Aug. 19, 2019

How to Upgrade Colab with More Compute - Learn how to use Google Cloud Platform’s Deep Learning VMs to power up your Colab environment, on this episode of AI Adventures

Data Science Aug. 12, 2019

4 Data Studio tricks - UX and UI tips for Data Studio.

BigQuery Data Science July 29, 2019

BigQuery: SQL on Nested Data - Examples of working with nested data in BigQuery.

BigQuery Data Science Machine Learning July 29, 2019

Clustering 4,000 Stack Overflow tags with BigQuery k-means - Using BigQuery ML to cluster tags from StackOverflow.

Big Data BigQuery Data Science Java July 15, 2019

Beast: Moving Data from Kafka to BigQuery - GOJEK’s open source solution for moving data from Kafka to Google BigQuery.

Data Science DevOps Kubernetes Machine Learning July 15, 2019

Automated Model Retraining with Kubeflow Pipelines - How to implement a reproducible ML workflow that adapts to new data

BigQuery Data Science July 8, 2019

New in BigQuery: Persistent UDFs - Using new functionality of saving User Defined Functions in BigQuery.

BigQuery Data Science Python July 8, 2019

BigQuery and Public Datasets. Overview for Data Analysts - In this article we’ll briefly explore what is BigQuery and how a data analyst can access and use it through various interfaces with…

BigQuery Data Science July 8, 2019

An open source Python package for moving HelpScout data into Google BigQuery - This article is written for business analysts, data scientists and engineers that need to integrate Help Scout data into their Google BigQuery pipeline, and have hands-on experience dealing with Python, APIs and SQL databases.

Big Data Data Analytics Data Catalog Data Science July 8, 2019

Google Cloud Data Catalog hands-on guide: templates & tags with Python - This quickstart guide brings a practitioner approach to Data Catalog, covering Templates & Tags management using the Python client library.

BigQuery Data Science Data Studio June 24, 2019

From College to the Pros with Google Cloud Platform (Part 1) - Getting together and analyzing NBA players stats.

Data Science GCP Certification June 24, 2019

10 Days to Become a Google Cloud Certified Professional Data Engineer - Overview of resources used for Data Engineer exam preparation.

Data Science R June 24, 2019

From College to the Pros with Google Cloud Platform (Part 2) - The second part of NBA players analysis.

Cloud Dataproc Data Science June 17, 2019

Scale out RAPIDS on Google Cloud Dataproc - Scaling GPU data jobs on Cloud Dataproc.

BigQuery Data Science GCP Experience June 17, 2019

Analytics at lightspeed with Google BigQuery - The article describes how Aditya Birla Group created a digital platform on GCP to manage the travel of their employees.

Data Science June 17, 2019

Setup Julia with Jupyter notebook on Google Cloud Platform - Tutorial on how to set up and use Julia on Jupyter notebooks hosted on GCP.

Data Science Security June 10, 2019

How to use cloud storage to securely load data into Neo4j - Methods for loading data into a remote Neo4j Instance — Part 2

Apache Beam Cloud Dataflow Data Science Python May 13, 2019

Let’s Build a Streaming Data Pipeline - Creating Apache Beam / DataFlow pipeline to parse web server logs.

Data Science GCP Certification May 13, 2019

Passing the (new) Google Professional Data Engineer exam within 7 weeks - Experience of preparing and taking Data Engineer certification.

Data Science April 15, 2019

How to get started with Google Colab and Kaggle - Example of using Colab for Kaggle competitions.

AI Data Science Machine Learning April 8, 2019

GCP Notebook Executor v0.1.2 - Executing long running Jupyter Notebook jobs on GCP.

BigQuery Cloud Dataflow Cloud Dataprep Data Science Machine Learning TensorFlow April 8, 2019

End-to-end churn prediction on Google Cloud Platform - Overview of GCP architecture to build customer churn prediction compromising of data acquisition, data wrangling, modeling, model deployment, and a business use case.

 

Latest Issues




Contact

Zdenko Hrček
Třebanická 183
Prague, Czech Republic
Phone: +420 777 283 075
Email: [email protected]