Category Archives: Data Analytics & Visualization

Simple Polar Clock in Javascript

Ever since discovering the Chronotebook, I’ve been toying with ideas about the best way to represent cyclical time.

I found a few good examples to get me started and I created this minimalist design:


There are several ideas and features I’d like to implement in the future. Stay tuned!

Creative Commons License
Polar Clock by Fred Eaker is licensed under a Creative Commons Attribution 3.0 Unported License.

Text Extraction and File Type Detection

Text Extraction Comparison - Tomaž Kovačič
Back in December 2006, I ran a series of tests on Java-based file type detection. At the time, I was researching digital asset management systems and in particular, the possibility of open source full-text search and semantic analytics.

Fast-forward to June 2011 and ReadWriteWeb’s Head to Head Comparison of Text Extraction Algorithms. It is amazing to look back at my old research and see what tools are available now that would have been part of the evaluation process. I also like Tomaž Kovačič’s thorough explanation of his testing methods and results.

Business Intelligence and Data Science

Steve Miller makes an interesting series of comparisons between Business Intelligence and Data Science, extensively referencing Mike Loukides’ “What is data science?”, culminating in this table:

Business Intelligence Data Science
Content/Tools Decision Support System lineage Statistical Science lineage
Relational Database-centric Cloud-centric, Massively Parallel, alernate data stores such as Cassandra, Hadoop
Database Warehouse Data Platform
Focus on reporting and dashboards Focus on statistics and experiements
Online analytical processing (OLAP) Machine Learning
Extract, transform, load (ETL) Data munging/conditioning
Visualization Visualization and creative design
Mostly propreitary, some open source Mostly open source, some propreitary
Business IT-owned Analytics-owned
Technology and business Mathematics and science
Performance management Data products
Methodical Inspirational
Middle-aged Adolescent
Division of labor Jack of all trades
Teams One-offs
Short- to medium-sized projects Quicker hits
Prescision Speed
More governance Less governance
Data Complete data Missing data
Quality-centric Quantity-centric
Absolute Approximate
More interal data More external data
Structured data Structured and unstructured data
Small, medium and large data Big data