Understanding and Developing Data Strategy and Monetization
A data strategy is a plan that outlines how an organization will collect, store, manage, and use its data. It is important because it can help organizations to improve decision-making,…
Covering Data Science, Business Intelligence, Technology Industry News Updates and Trends
A data strategy is a plan that outlines how an organization will collect, store, manage, and use its data. It is important because it can help organizations to improve decision-making,…
This tutorial will cover how to install PostGIS on Ubuntu 18.04 Linux for Greenplum 6 clustered setup. PostGIS is an open source PostgreSQL database extension for creating Geographic objects on…
Introduction To be successful as data analysts you need to learn fundamental data analysis techniques, and data-oriented programming languages, and have a strong background in math. Here are the most…
Are looking for a tool to visualize, explore and analyze data that can be modified and adapted to organizational architecture while it does not cost a fortune? Apache Superset is…
In this code example, we will apply a convolutional neural networks(CNN) to textual data. More specifically, we will use the structure of CNNs to classify text. Unlike images, which are…
In this article, we discuss key concepts of hadoop and answer various common questions related to hadoop.
This tutorial provides, hands on coding examples covering database programming for Greenplum/PostgreSQL using SQLAlChemy library.
This tutorial shares basic database programming code for PostgreSQL and Greenplum using Python and psycopg2 library.
Not many data scientists realize that it is possible and much more beneficial to build data science models directly on top of a data warehouse. Apache MADlib is an open-source…
Python multiple choice questions for learning and self improvement.
Common data science questions and answers for refreshing your data science knowledge with the help of most common data science interview questions.
Building great data science models is key to success in business. Understanding the difference between data science and business intelligence is key to using these to succeed in your business…
Greenplum Architecture The main reason behind adaptation of massively parallel processing(MPP), data warehouse(DWH) solution is MPP architectural principles. These principles aim at removing main drawbacks of traditional DWH, and make…
Exploratory data analysis is an important step in any data science and even business intelligence project, which gives you adequate understanding of underlying data.
Database programming in Python using SQLAlchemy library that provides uniform interface for connecting with different databases.
This article provides a good overview of exploratory data analysis techniques, and then provides code examples using seaborn library.
Data Scientists often turn to famous deep neural networks such as ALEXNET and LENET to gain inspiration for designing neural networks for real world problems.
One of the key reasons that we often use deep learning is because we need to extract features that would effectively solve a machine learning problem.
Matplotlib is a cross-platform, data visualization and graphical plotting library for Python and its numerical extension NumPy. As such, it offers a viable open source alternative to MATLAB. Developers can…
NumPy – A Replacement for MatLab NumPy is often used along with packages like SciPy (Scientific Python) and Matplotlib (plotting library). This combination is widely used as a replacement for…