I was lucky enough to attend the maiden presentation of this at Microsoft Reading yesterday. It was pretty gripping stuff not only because of what was said but also because of what could only be hinted at. Here’s what I took away from the day. (Disclaimer: I’m not a BI guru, just a reasonably experienced BI developer, so I may have misunderstood or misinterpreted a few things. Particularly when so much of the talk was about the vision and subtle hints of what is coming. Please comment if you think I’ve got anything wrong. I’m also not going to even try to cover Master Data Services as I struggled to imagine how you would actually use it.)
I was a bit worried when I learned that the whole day was going to be presented by one guy but Rafal Lukawiecki is a very engaging speaker. He’s going to be presenting this about 20 times around the world over the coming months. If you get a chance to hear him speak, I say go for it. No doubt some of the hints will become clearer as Denali gets closer to RTM.
Firstly, things are definitely happening in the SQL Server Reporting and BI world. Traditionally IT would build a data warehouse, then cubes on top of that, and then publish them in a structured and controlled way. But, just as with many IT projects in general, by the time it’s finished the business has moved on and the system no longer meets their requirements. This not sustainable and something more agile is needed but there has to be some control. Apparently we’re going to be hearing the catchphrase ‘Balancing agility with control’ a lot.
More users want more access to more data. Can they define what they want? Of course not, but they’ll recognise it when they see it. It’s estimated that only 28% of potential BI users have meaningful access to the data they need, so there is a real pent-up demand. The answer looks like: give them some self-service tools so they can experiment and see what works, and then IT can help to support the results. It’s estimated that 32% of Excel users are comfortable with its analysis tools such as pivot tables. It’s the power user’s preferred tool. Why fight it? That’s why PowerPivot is an Excel add-in and that’s why they released a Data Mining add-in for it as well.
It does appear that the strategy is going to be to use Reporting Services (in SharePoint mode), PowerPivot, and possibly something new (smiles and hints but no details) to create reports and explore data. Everything will be published and managed in SharePoint which gives users the ability to mash-up, share and socialise what they’ve found out. SharePoint also gives IT tools to understand what people are looking at and where to concentrate effort. If PowerPivot report X becomes widely used, it’s time to check that it shows what they think it does and perhaps get it a bit more under central control. There was more SharePoint detail that went slightly over my head regarding where Excel Services and Excel Web Application fit in, the differences between them, and the suggestion that it is likely they will one day become one (but not in the immediate future).
That basic pattern is set to be expanded upon by further exploiting Vertipaq (the columnar indexing engine that enables PowerPivot to store and process a lot of data fast and in a small memory footprint) to provide scalability ‘from the desktop to the data centre’, and some yet to be detailed advances in ‘frictionless deployment’ (part of which is about making the difference between local and the cloud pretty much irrelevant).
Excel looks like becoming Microsoft’s primary BI client. It already has:
- the ability to consume cubes
- strong visualisation tools
- slicers (which are part of Excel not PowerPivot)
- a data mining add-in
A major hurdle for self-service BI is presenting the data in a consumable format. You can’t just give users PowerPivot and a server with a copy of the OLTP database(s). Building cubes is labour intensive and doesn’t always give the user what they need. This is where the BI Semantic Model (BISM) comes in. I gather it’s a layer of metadata you define that can combine multiple data sources (and types of data source) into a clear ‘interface’ that users can work with. It comes with a new query language called DAX. SSAS cubes are unlikely to go away overnight because, with their pre-calculated results, they are still the most efficient way to work with really big data sets.
A few other random titbits that came up:
- Reporting Services is going to get some good new stuff in Denali.
- Keep an eye on www.projectbotticelli.com for the slides. You can also view last year’s seminar sessions which covered a lot of the same ground as far as the overall strategy is concerned. They plan to add more material as Denali’s features are publicly exposed.
- Check out the PASS keynote address for a showing of Yahoo’s SQL BI servers. Apparently they wheeled the rack out on stage still plugged in and running!
- Check out the Excel 2010 Data Mining Add-Ins. 32 bit only at present but 64 bit is on the way.
- There are lots of data sets, many of them free, at the Windows Azure Marketplace Data Market (where you can also get ESRI shape files).
- If you haven’t already seen it, have a look at the Silverlight Pivot Viewer (http://weblogs.asp.net/scottgu/archive/2010/06/29/silverlight-pivotviewer-now-available.aspx).
- The Bing Maps Data Connector is worth a look if you’re into spatial stuff (http://www.bing.com/community/site_blogs/b/maps/archive/2010/07/13/data-connector-sql-server-2008-spatial-amp-bing-maps.aspx).