I was somewhat excited and was looking forward to reading this book, given that it firstly is on my area of work at Oracle - enterprise analytics, secondly because it covers an important, growing, and still nebulous area of both the technology and business of Big Data, and thirdly, because the author is a very well-respected authority on the field and who shot to wider fame with his now famous 2007 article in the Harvard Business Review, "Competing on Analytics". With this preamble, I submit to you, my review of the book, "Enterprise Analytics: Optimize Performance, Process, and Decisions Through Big Data (FT Press Operations Management)", Thomas H. Davenport, et al.
(Amazon, My Amazon review, Kindle US, Amazon UK, Kindle UK, Flipkart)
Or, how a book on Big Data, Enterprise Analytics, and technology can neatly skirt any meaningful discussion of Big Data, Enterprise Analytics, and technology.
While a few chapters stand out for their reasoning and clarity, what is jarringly absent from this book is any meaningful, technical discussion about Big Data itself. Without such a discussion, most of the book's content can be recycled with minimum effort ten years from now and applied to the next big thing in technology. Even assuming that this book is targeted at decision makers and so-called C-level executives, an absence of the nuances and complexities of Big Data mean that executives will be as clueless on that dimension of Big Data knowledge after reading the book as before. If you are responsible for selling sausages, you had jolly well get a look at the sausage factory, if not work there a day.
Big Data, Unstructured Data, the Cloud - if these three buzzwords were not enough, you can add the salsa-ish phrase SoLoMo - i.e. Social, Local, and Mobile, to the mix. Businesses, consultants, enterprises, anyone who is anything in technology wants to know more about what this buzzword alphabet soup is, and how to make sense of it before their competitor does, or worse - a disruptor.
The hope is that if the decision makers, the corner-room occupiers can understand this, they will be better able to drive a coherent process and structure within their organizations to take advantage and benefit. Hence this book.
The book is a collection of eighteen chapters, divided into five parts - the first is an "Overview", "Application", "Technologies", "Human Side", and "Case Studies" of Analytics. Each chapter is written by a different author, with a total of fourteen authors in the fray. A few chapters have been written by the editor himself, Thomas Davenport, and these are among the standout chapters - for their clarity and organization.
Since I work in the technology of analytics, I should be excused for either taking too technical a view of things, or for being too harsh in my criticisms. Having said that, there are at least quibbles that in my opinion leave this book only a middling, mediocre effort, and not one that will be remembered or consulted, much, if at all.
- The close and financial collaboration of at least a few technology vendors with the International Institute of Analytics means that most of the specific examples cited in this book are where the technology vendor's solution was used. Fair enough, but it leaves an aftertaste of an advertorial in the reader's mind.
- The chapter on "NBOs", i.e. "Next Best Offers" takes several cheap shots at Amazon (see page 90), which left me wondering whether Amazon had not turned down six-figure consulting offers from either of the authors to warrant this broadside.
- There is a laboured chapter on "engagement" - an attempt to define a compound measure based on essentially a summation of basically arbitrary weighted base measures. For instance, a measure of online engagement is a summation of eight different indices. Putting a "sigma" symbol in the equation makes it look impressive, but in the end it is more arbitrary than methodical. Because decision makers need to have information supplied to them in a simple manner, it is often supplied to them in a simplistic manner, cloaked in technical-sounding phrases.
- Privacy is becoming an increasingly sensitive and relevant issue as more data is collected from customers and users, often without their knowledge, and sometimes without their consent, almost always without providing users a clear picture of what is done with that user-data so collected. Privacy is an important topic in this discussion on enterprise analytics. And it is given short shrift in the book. After a cursory nod, almost as an afterthought, to privacy concerns, there are examples cited that are almost creepy in the extent they suggest the invasion of a user's privacy. Sample these: "The next generation of video game offers could have pictures of your friends or your tastes and interests built right in." Or "A company called Sense Networks has developed an application to help infer a person's lifestyle based on his or her location history." Harvesting a user's location and web-click information should require explicit opt-in - it is basic respect for human decency. Take the section where the authors talk about collecting data by anonymizing data. It has been proven that even after anonymizing data, it is possible to individually identify users with a very high degree of accuracy based on only a few attributes. There is no such thing as truly anonymous user-tracking on the web. Only if you use "unsophisticated marketing techniques" do you risk customer ire over harvesting their "spending habits", which they consider "inviolate". Get it? Be sophisticated, and you can get away with pillaging your customers' privacy. Is this really the new normal - customer privacy shmrivacy?
- Then there is this most curious statement that states - "Despite all the hype around the unstructured data component of "big data", it seems that structured data still rules the in predictive analytics." Well, yes! Unstructured data is fairly recent, especially when compared with structured data, that has been around for literally decades. It is but natural that the use of unstructured data in predictive analytics will take time to gain traction, especially as the technology and means of blending structured and unstructured data evolve.
- Even the chapter, "Predictive Analytics in the Cloud" contains phrases that make absolutely no sense, other than to bump up the chapter's jargon-index. Sample this: "These cloud-based solutions inject predictive analytics into other software that is cloud-based or delivered as SaaS." Is a specific example too much to ask? "inject" is an impressive-sounding word, especially if you have heard of the phrase "sql-injection" in the context of hacking attacks, but just what does the word "inject" mean in the context of predictive analytics and the cloud? And why does this injection require the cloud? Can it not be done with more traditional, hosted solutions? And what exactly are "cloud-based dashboards"??? Is any dashboard served via a browser "cloud-based"?
I could go on an on, but a short summary of the book would be this: each chapter suggests and promises value, but falls short.
In a good mood, as always.
Jan 15, 2013
[Originally posted on my personal blog, at blog.abhinavagarwal.net]