Data analysis

 

Graphviz by Example: Part Two

My previous post introduced the dot language and how it can be utilized to create flowcharts. For part two, I sought to partially reproduce a more demanding visualization to highlight how Graphviz could be used. The original graphic was taken from the website for the…

Read More

Graphviz by Example: Part One

Introduction GraphViz is an open-source software package developed by AT&T Labs for generating directed graphs and flowcharts. Outputs are created using Dot, a plain text graph description language that is part of the Graphviz package. GraphViz is a powerful application that allows users to create…

Read More

R 101: Summarizing Data

When working with large amounts of data that is structured in a tabular format, a common operation is to summarize that data in different ways using specific variables. In Microsoft Excel, pivot tables are a nice feature that is used for this purpose. While not…

Read More

Marketing Strategies for New Business

What can Walmart’s social media marketing strategy, teach other retailers and merchants?  From everyday mom and pop retailers to well-known retail giants, like Walmart and Target; everyone is taking advantage of social media marketing. Whether their favorite outlet is Facebook, Twitter, Pinterest, Google+, everybody wants…

Read More

ADVANCED CLUSTERING TOOLS IN R

INTRODUCTION R showcases several useful clustering tools, but there is a particular tool that is unusually useful in its combination of method and output display -it is the integration of hierarchical clustering with a visual display of its results in a heatmap.  The term “heatmap”…

Read More

Using csvkit to Summarize Data: A Quick Example

As data analysts, we’re frequently presented with comma-separated value files and tasked with reporting insights. While it’s tempting to import that data directly into R or Python in order to perform data munging and exploratory data analysis, there are also a number of utilities to…

Read More

Data Analysis from External Service Providers

Why should retailers explore transferring their advanced data analytics requirements to an external service provider? There is a vast wealth of knowledge that is floating around the World Wide Web and collecting in the digital storage vaults of retail companies across the world. Making sense…

Read More

Turning Data Into Awesome With sqldf and pandasql

Both R and Python possess libraries for using SQL statements to interact with data frames. While both languages have native facilities for manipulating data, the sqldf and pandasql provide a simple and elegant interface for conducting tasks using an intuitive framework that’s widely used by analysts.    R…

Read More

Strategies to Enhance Social Media Campaigns

Five strategies for retailers to add amp to their social media campaigns Social media has changed the way businesses market themselves these days. Sites like Facebook, Twitter, YouTube, Foursquare and Pinterest are used by scores of users every single day. A good social media marketing…

Read More

Data Mining is not Screen Scraping

Data Mining and Screen Scraping do not mean the same thing in computing. There are some people who may disagree with this remark, but they are two separate concepts and disciplines. To give a brief answer, one can explain the concepts this way: Screen Scraping…

Read More