Ace the Data Science Interview by Refreshing Top Hadoop Questions and Answers
In this article, we discuss key concepts of hadoop and answer various common questions related to hadoop.
Covering Data Science, Business Intelligence, Technology Industry News Updates and Trends
In this article, we discuss key concepts of hadoop and answer various common questions related to hadoop.
This tutorial provides, hands on coding examples covering database programming for Greenplum/PostgreSQL using SQLAlChemy library.
This tutorial shares basic database programming code for PostgreSQL and Greenplum using Python and psycopg2 library.
Not many data scientists realize that it is possible and much more beneficial to build data science models directly on top of a data warehouse. Apache MADlib is an open-source…
Greenplum getting started tutorial, covering basic commands to start/stop the database, creating schema, and tables.
Solving data science problems requires systematic thinking and approach, here's some of the key concepts and steps that you need to apply when trying to solve data science problems.
Understanding big data architecture, and developing effective strategies for big data is key to building modern data science solutions. Here, we provide high level overview of big data architecture.
Q1. What is the minimum no. of variables/ features required to perform clustering? A. 0 B. 1 C. 2 D. 3 Solution: (B) At least a single variable is required…
Python multiple choice questions for learning and self improvement.
Common data science questions and answers for refreshing your data science knowledge with the help of most common data science interview questions.
Building great data science models is key to success in business. Understanding the difference between data science and business intelligence is key to using these to succeed in your business…
Spark is a fast, easy-to-use and flexible data processing framework. It has an advanced execution engine supporting cyclic data flow and in-memory computing. Spark can run on Hadoop, standalone or…
What is Sentiment Analysis ? Sentiment analysis and opinion mining technology is an important tool for businesses, and governments to judge and evaluate the reaction of people for their services.…
Deep learning is a field with intense computational requirements, and your choice of GPU will fundamentally determine your deep learning experience. As discussed previously a CPU is divided into multiple…
Greenplum Architecture The main reason behind adaptation of massively parallel processing(MPP), data warehouse(DWH) solution is MPP architectural principles. These principles aim at removing main drawbacks of traditional DWH, and make…
Exploratory data analysis is an important step in any data science and even business intelligence project, which gives you adequate understanding of underlying data.
Database programming in Python using SQLAlchemy library that provides uniform interface for connecting with different databases.
This article provides a good overview of exploratory data analysis techniques, and then provides code examples using seaborn library.
Understanding Greenplum architecture and use cases for various big data and data science scenarios for business and industry.
Understanding payment fraud is key to successfully building a payment fraud detection system. In this article, we provide a comprehensive overview of payment fraud analysis.