Understanding and Developing Data Strategy and Monetization
A data strategy is a plan that outlines how an organization will collect, store, manage, and use its data. It is important because it can help organizations to improve decision-making,…
Covering Data Science, Business Intelligence, Technology Industry News Updates and Trends
A data strategy is a plan that outlines how an organization will collect, store, manage, and use its data. It is important because it can help organizations to improve decision-making,…
Fear of AI is a type of technophobia, which is a fear of computers, AI, robots, and similar technologies. AI has been the subject of much discussion and debate in…
Introduction Artificial General Intelligence (AGI) refers to a hypothetical form of artificial intelligence that would be capable of performing any intellectual task that a human can do. Unlike current AI…
An LLM is a type of neural network based on the Transformer architecture, which was first introduced in a 2017 paper by Google engineers titled “Attention is All You Need.”…
Data Lakes are the foundation for building AI driven solutions that many organizations need. There is much more information in audios, videos, images and social media chatter that can drive…
This tutorial will cover how to install PostGIS on Ubuntu 18.04 Linux for Greenplum 6 clustered setup. PostGIS is an open source PostgreSQL database extension for creating Geographic objects on…
Windows functions are an important tool for data analysis. In this tutorial, we go through windows functions in SQL and provide examples for their use in Greenplum.
Introduction Greenplum follows a master/slave architecture, where data is distributed on segment nodes. Master node is responsible for executing user queries on segment nodes, and getting the results back to…
Quick reference for essential Git commands.
Introduction To be successful as data analysts you need to learn fundamental data analysis techniques, and data-oriented programming languages, and have a strong background in math. Here are the most…
Are looking for a tool to visualize, explore and analyze data that can be modified and adapted to organizational architecture while it does not cost a fortune? Apache Superset is…
In this article, we discuss key concepts of hadoop and answer various common questions related to hadoop.
This tutorial provides, hands on coding examples covering database programming for Greenplum/PostgreSQL using SQLAlChemy library.
This tutorial shares basic database programming code for PostgreSQL and Greenplum using Python and psycopg2 library.
Not many data scientists realize that it is possible and much more beneficial to build data science models directly on top of a data warehouse. Apache MADlib is an open-source…
Greenplum getting started tutorial, covering basic commands to start/stop the database, creating schema, and tables.
Solving data science problems requires systematic thinking and approach, here's some of the key concepts and steps that you need to apply when trying to solve data science problems.
Understanding big data architecture, and developing effective strategies for big data is key to building modern data science solutions. Here, we provide high level overview of big data architecture.
Q1. What is the minimum no. of variables/ features required to perform clustering? A. 0 B. 1 C. 2 D. 3 Solution: (B) At least a single variable is required…
Common data science questions and answers for refreshing your data science knowledge with the help of most common data science interview questions.