Posts

Transmissions from TechEd USA 2009 (Day 3)

Today was a long day. I started by attending the Richard Tkachuk’s A First Look at Large-Scale Data Warehousing in Microsoft SQL Server Code Name “Madison”. Those of you familiar with Analysis Services would probably recognize the presenter’s name since Richard came from the Analysis Services team and maintains the www.sqlservernanalysisservices.com website. He recently moved to the Madison team. Madison is a new product and it’s based on a product by DATAllegro which Microsoft acquired sometime ago. As the session name suggests, it’s all about large scale databases, such as those exceeding 1 terabyte of data. Now, this is enormous amount of data that not many companies will ever amass. I’ve been fortunate (or unfortunate) that I never had to deal with such data volumes. If you do, then you may want to take a look at Madison. It’s designed to maximize sequential querying of data by employing a shared-nothing architecture where each processor core is given dedicated resources, such as a table partition. A controller node orchestrates the query execution. For example, if a query spans several tables, the controller node parses the query to understand where the data is located. Then, it forwards the query to each computing node that handles the required resources. The computing nodes are clustered in a SQL Server 2008 fail-over cluster which runs on Windows Server 2008. The tool provides a management dashboard where the administrator can see the utilization of each computing node.

Next, I attended the Fifth Annual Power Hour session. As its name suggests, TechEd has been carrying out this session for the past five years. The session format was originally introduced by Bill Baker who’s not longer with Microsoft. If you ever attended one of these sessions, you know the format. Product managers from all BI teams (Ofice, SharePoint, PerformancePoint, and SQL Server) show bizarre demos and throw t-shirt and toys to everything that moves (OK, sits). The Office team showed an Excel Services demo where an Excel spreadsheet ranked popular comics characters. Not to be outdone, the PerformancePoint team showed a pixel-based image on Mona Lisa. Not sure what PerformancePoint capabilities this demonstrated since I don’t know PerformancePoint that well but it looked great.

The Reporting Services team showed a cool demo where the WinForms ReportViewer control would render a clickable map (the map control will debut in SQL Server 2008 R2) that re-assigns the number of Microsoft sales employees around the US states. For me, the coolest part of this demo was that there was no visible refresh when the map image is clicked although there was probably round tripping between the control and the server. Thierry D’Hers later on clued me in that there is some kind of buffering going on which I have to learn more about. This map control looks cool! Once I get my hands on it with some tweaking maybe I’ll be able to configure it as a heat map that is not geospatial.

Finally, Donald Farmer showed another Gemini demo which helped learn more about Gemini. I realized that 20 mil+ rows were compressed to 200 MB Excel file. However, the level of compression really depends on the data loaded in Excel. Specifically, it depends on the redundant values in each column. I learned that the in-memory model that is constructed in Excel is implemented as in-process DLL whose code was derived from the Analysis Services code base. The speed of the in-memory model is phenomenal! 20 mil rows sorted within a second on the Donald’s notebook (not even laptop, mind you). At this point Microsoft hasn’t decided yet how Gemini will be licensed and priced.

As usual, after lunch I decided to hang around in the BI learning center and help with questions. Then, it was a show time for my presentation! I don’t why but every TechEd I get one of these rooms that I feel intimidated just to look at them. How come Microsoft presenters who demo cooler stuff than mine, such as features in the next version, get smaller rooms and I get those monstrous rooms? It must be intentional; I have to ask the TechEd organizers. The room I got was next to the keynote hall and could easily accommodate 500-600 people, if not more. Two years ago, I actually had a record of 500+ people attending my session which was scheduled right after the keynote.

This year, the attendance was more modest. I don’t have the final count yet, but I think about 150+ folks attended my session so there was plenty of room to scale up. I think the presentation well very well. The preliminary evaluation reports confirm this. I demoed report authoring, management, and delivery tips sprinkled with real-life examples. We had some good time and I think everyone enjoyed the show.

It’s always good to know that your work is done. I look forward to enjoying the rest of TechED and LA.

Transmissions from TechEd USA 2009 (Day 2)

I started the day by attending the Donald Farmer and Daniel Yu’s session “Creating the Right Cubes for Microsoft Excel and Excel Services” hoping that I’ll get a sneak preview of Excel 2010. Alas, it was all about refining the cube definition with display folders, perspectives, hierarchies, etc. so it appears more user-friendly in Excel. Later, I learned that Office 2010 (or whatever it will be called) is under strict NDA which explains the lack of demos. The most interesting thing about that session was that I finally understood why the SSAS team decided to scale down the cube wizard in SSAS 2008 to generate basic dimensions only. The reason was performance. You see, the BIDS 2005 cube wizard would oftentimes suggest non-optimal dimension hierarchies and the modeler wouldn’t revise the design leading to bad performance.

Next, I attended the Thierry D’Hers Top Ten Reasons for Using Microsoft SQL Server 2008 Reporting Services. It’s always good to watch RS-related sessions. Thierry listed the community support as one of the reasons for organizations to consider moving to SSRS. I agree with this and the great community support is applicable to all MS technologies. Speaking about BI only, a few years ago there wasn’t a single book about Cognos for example. Granted, the last I look on Amazon there was one book. In comparison, Microsoft has a vibrant community of book authors, MVPs, trainers, etc. Almost every publisher has a book about SSRS 2008. This is of course good for the community and so good about authors as the competition is tough J

Thierry listed the Report Builder 2.0 as #1 reason to move to SSRS 2008, followed by data visualization, and tablix. Based on my real-life projects, I’d personally have listed them in the reverse order, with tablix being #1. This session officially announced that the map control will make to the Kilimanjaro release (now officially called SQL Server 2008 R2). Later on, Thierry was kind enough to show me a demo of the map pre-release bits. One of the data modes is using the SQL Server 2008 geospatial data types which would let you map any region in the world. Thierry showed a cool report showing the worldwide sales of SQL Server, where each country had a different color gradient based on the sales volume.

After lunch, I hang around the BI area in the learning center to answer questions and rub shoulders with Microsoft employees and peers. I was surprised to learn that there are only two Reporting Services-related sessions for the entire TechEd and one of them is mine! All of a sudden, I felt 2″ taller. Later on, I felt adventurous to learn something completely new and attended a SQL Server 2008 Failover Clustering only to realize how much I don’t know about it since I’ve never used it. BTW, there are great advancements in the SQL Server 2008 failover clustering, such as ability to upgrade or patch a cluster node without stopping it.

Transmissions from TechEd USA 2009 (Day 1)

Day 1 of TechEd 2009 is almost over with the exception of the Community Influencers Party tonight. I heard that this year they expect 7,000 attendees. This is a huge scale-down from previous years. For instance, we had 16,000 attendees at TechEd USA 2007. Economy is hitting everything hard.

I thought the keynote was kind of lame. Judging by it, Microsoft has only three products: Windows 7 (officially announced to ship around holidays although Microsoft didn’t say which holidays), Windows Server 2008 (the buzz is now the forthcoming R2 release), and Exchange Server 2010. Unlike previous TechEds, there wasn’t a single announcement about other products. SQL Server KJ, Office 2010, Azure, dev tools? Nope, apparently not worth mentioning. Sure Mark Russinovich, whom I respect very much, did some cool Windows 7 demos but there were not enough to pique my interest. I understand that OS and Exchange Server are bedrock for every business and after the sad Vista saga, we have to show the world that now we’ll do things right with Windows 7, but the BI soul in me was thirsty for more.

After lunch, I hang around the BI area of the Learning Center, where I answered questions and met with other peers, including Nick Barclay (MVP) whom I wanted to meet personally for a while. Then, I attended the excellent Donald Farmer and Kamal Hathi ‘s Microsoft Project Code Name “Gemini: Self Service Analysis and the Future of BI and I had the chance to see the Gemini, which I blogged briefly about before without knowing too much, for the first time in action and gain more in-depth knowledge.

The Gemini is an end-user oriented Excel add-in that will let the user acquire data from a variety of data sources, including SSRS reports (SSRS KJ will expose reports as data feeds) and SharePoint lists, and load them in an Excel spreadsheet. The tool crunches data very fast even on a modest computer (the demo showed a notebook computer working with millions of rows) thanks to its ability to compress column-level data. This works because a dataset column would typically contain redundant data.

Once data is loaded in Excel, the tool will attempt to automatically determine the relationships between datasets (loaded in separate spreadsheets) to create a hidden dimensional model consisting of fact and dimension in-memory tables. The user will be able to manually specify the dataset relationships by telling the tool which column will be used to join the datasets (very much like joining relational tables). Moreover, the user will be able to define calculated columns using Excel-style formulas. Finally, as the add-in builds behind the scenes an in-memory cube, the user will be able to slice and dice data in a Pivot table report. So, no Analysis Services is needed if all the user wants is manipulating data on the desktop.

Where things are getting more interesting is deploying the models on the server. To do so, the end user would deploy the Excel spreadsheet to the MOSS Report Library. Note that MOSS is required for server-side deployment. When other users request the spreadsheet, an Analysis Services redirector will understand that this is a Gemini model and service the requests from a server cube. At this point is not clear how exactly the server cube will be built and whether it could be managed in SSMS. Once the cube returns data, Excel Services will kick in to return data in HTML. A Reporting Services client can also connect to the server cube by its URL. This is no different than connection to a regular cube as Reporting Services will launch the familiar MDX query designer.

So, where is the IT in the new Gemini world? IT will use a cool MOSS dashboard to understand who’s deployed what model and how the models are used, such as when the datasets were refreshed, what are the most popular models, what resources these models took on the server, etc.

What’s my personal take on Gemini? It’s not up to me to decide how useful it is since it’s a business-oriented tool, such as Report Builder 2.0. Business users will have the final word. Based on my personal experience though, the data analytics problems that I need to solve with traditional Analysis Services cubes surpass the Gemini capabilities by far. So, don’t throw your MDX knowledge out of the door yet. In my line of work, I can see Gemini being useful as a cube prototyping tool, especially in the early stages of requirement gathering where data can be typed in Excel and I can demonstrate to users what a cube can do for them. Of course, Microsoft plans for Gemini are much more ambitious than that. In the ideal world, all business users would upgrade to Office 2010 and create cool Gemini models to give IT folks a long-deserved break ;-). Or, so the fairytale goes….

To wrap up the day, I attended What’s New in Microsoft SQL Data Services presentation by Rick Negrin to find out that SQL Data Services is nothing more than SQL Server running on Microsoft data centers. SQL Data Services will support two application connectivity modes: a “code near” mode where the application (typically a web application) is deployed to Azure and “code far”, where the application will connect to SQL Server over Internet using the TDS protocol. Microsoft role is to provide scalability and failover. Not all SQL Server features will be available in version 1. For example, CLR will not make the cut.

A long and tiring day. I am off to the party now.

Speaking at TechEd 2009 USA

One of my TechEd 2009 session proposals got approved. The session is tentatively named Reporting Services 2008 Tips and Tricks, How-to, and Beyond. I am planning to cover solutions to common questions and challenges that span the three phases of the reporting lifecycle: authoring, management and delivery. I hope you can attend my session If you planning to attend TechEd 2009 USA. Shoot me a note if you want me to cover something in particular. See you in LA!

My TechEd Session Scheduled

My breakout TechEd session BIN305 – Applied Microsoft SQL Server 2005 Reporting Services is scheduled for June 4 at 10:30 AM – 11:45 AM in room S320 A. That’s right – let’s talk reporting right after the keynote!

I hope to see some of you there.

Presenting at TechEd 2007

Microsoft has just let me know that my proposal for a TechEd 2007 breakout session has been approved. I’ve been honored to present Microsoft SQL Server Reporting Services for four consecutive years at TechEd. This time, I will share practical lessons and tips harvested from report-enabling a real-life application. Here is the formal session description (not yet on the TechEd website):

One of the most prominent feature of Microsoft Reporting Services is its open and extensible architecture that allows developers to report-enable easily their custom applications. Join this “from-the-trenches” session to learn how we leveraged Microsoft Reporting Services 2005 and SQL Server 2005 to add ad-hoc reporting features to our line-of-business application. Topics covered include:

  • Understanding the importance of the Report Definition Language (RDL)
  • Building an RDL object model to abstract access to RDL
  • Integrating the application middle tier with the Report Server web service
  • Implementing flexible data retrieval with SQL Server 2005 CLR stored procedures
  • Using the VS.NET 2005 Report Viewer control for report viewing
  • Caching reports for better performance and historical analysis
  • Securing access to the report catalog with Windows and basic security
  • Extending reports with custom code

 

See you in Orlando!

TechEd Memoirs

TechEd 2006 US is over. Given the fact that VS.NET 2005, .NET 2.0, and SQL Server 2005 have matured, there weren’t many earth-shuttering announcements and cool demos. Anyway, here are some of the most interesting BI highlights written in the plane on my way home.


Brian Welcker told us that RS SP2 (CTP scheduled for November, final release alongside Office 2007 launch) will bring a much tighter level of integration between RS 2005 and SharePoint 2007.  This is great given that SharePoint is the Microsoft premium portal solution. Specifically, it would be possible to configure RS in a SharePoint configuration (SSP) mode using the RS configuration utility. In SSP mode, the entire RS catalog is migrated to the SharePoint database. Instead of the Report Manager, the SharePoint Administration portal will be used for report management. For example, the Report Server will authenticate the users using the SSP security settings. The SSP Report Viewer (a wrapper around the ASP.NET Report Viewer) will make the report viewing experience more SharePoint-like. For example, the report parameters will be presented in a dropdown list to minimize the horizontal space.


Jamie MacLennan from the SSAS team did a great presentation about driving report payloads when UDM is used as a data source. What impressed me most was that Jamie gave us an honest account about some of the problematic integration areas instead of a let-me-show-you-how-cool-everything-we-did-is type of a presentation. For example, he pointed out that SSAS hierarchies don’t have Report Builder equivalents. Not only parent-child dimensions are not supported in the Report Builder, but they will give wrong results. He recommended avoiding parent-child hierarchies in the Report Builder whenever possible or replacing them with flattened regular hierarchies in UDM. Jamie shared great tips about associating a calculated member with a measure group so it appears in the Report Builder and disabling the automatic Report Builder drillthrough reports when UDM is used a data source for performance reasons. The biggest surprise for me came when he stated that Microsoft now recommends splitting a large UDM in smaller models that leverage the linked dimension and measure group features. As performance tests by the SSAS team have demonostrated, this approach may result in better memory utilization and performance, as well as easier management.


T.K. Anand and Dan Battagin did a great demo of SSAS-Excel 97 integration. No big surprises here if you follow the David Gainer’s blog. T.K. Anand and Christian Pesculescu took us behind the scenes of the UDM storage model and shared very interesting details about how dimension and measure are stored. Later, they did another excellent presentation that focused on maximizing the UDM performance. They emphasized the importance of designing proper attribute relationships and avoiding duplicate attribute relationships. It was mentioned that there will be a SSAS performance whitepaper in near future.


I was also given a personal sneak preview of the forthcoming PerformancePoint server. As I noted in a previous blog, PerformancePoint will unite three products — Scorecard Manager, ProClarity, and Biz#. I asked the presenter to focus on the Biz# portion only. It realized that the client interface will be implemented in Excel 2007. In fact, it seems like the Microsoft Office team expects people to build BI-centric applications inside the Office products after retiring OWC. The Performance Point client will support comprehensive models for planning and forecasting. Behind the scenes, these functions will be carried out by the server layer of PerformancePoint which creates SSAS cubes to save the model data.


Finally, I think my presentation (Building Reporting Solutions with SSRS and SSAS) went really well. Hilton Logan (the hotel I stayed) surprised us with a false fire alarm in the wee hours of the morning (around 3:30 am) and decided to evacuate everybody outside. So, I went on stage quite sleep-deprived since I couldn’t get more sleep after the accident (oh, yeah, we had an MVP party the previous night). The presentation took place in one of these vast conference rooms that make me proud that I am an American. It could easily take 500 people and it had four large screens. Despite the early hour (8 am), about 200 people showed up to boot with a healthy dose of Microsoft BI. I got a few interesting questions during the presentation and after it in the Business Intelligence discussion area.


As always, TechEd was a great show. Some 12,000 people attended the conference and it was sold out well in advance. I liked Boston too as I enjoy all cities where one can walk instead of sitting paralyzed in a traffic jam. Unfortunately, Atlanta is not one of them 🙁

Join me in the "hub"

I’ve just learned that I’ve been selected to present at TechEd 2006 US in Boston! I’ve been honored to present Reporting Services for three consecutive years at TechEd. This time, I will change things a bit and instead of focusing on the development area, I will talk about integrating Reporting Services and Analysis Services. As of now, the main topics I am planning to include are:


1. SSAS report actions
2. MDX Query Designer
3. Server aggregates
4. Building data mining reports
5. Ad-hoc reporting
6. CLR integration


See you in the “hub”!

SQL Pass 2005 slides and code

SQL Pass 2005 was a great show. It had about 2,000 attendees, which is great si nce you don’t feel disconnected in the TechEd crowd of 13,000 people. I did a “RS Tips, Tricks, and Gotchas” presentation with Bruce L-C whom I finally had a chance to meet. I think the presentation went very well. I’ve uploaded the slides and the sample code to my website. Among other things the code demonstrates:




  • Excel export scenarios


  • CSV export


  • External images streamed from a web service


  • Expression-based connection strings


  • CLR stored procedures


  • HTTP handler for intercepting the incoming traffic to the server


  • And more…

Happy reporting!

Tech Ed US (Day 1)

Everyone was rushing in the morning to see Steve Ballmer’s keynote. He said there are some 11,000 folks at the conference. No wonder, all ice creams would disappear even before arrival! Here is some statistics that got posted on the TechEd site:


 


Feeding the Masses What does it take to feed 13,000 people at Tech·Ed?



  • 117,000 bottles of water
  • 14,300 lattes
  • 15,600 ice cream bars (it’s HOT out there)
  • 4,000 pounds of chips and snack mix
  • 2,300 dozen eggs
  • 8,000 pounds of chicken
  • 52,000 servings of juice

You get dizzy if you stay more than 10 minutes in the exhibitor hall area, so I decided to avoid at all cost, or cross it as fast as I can. Otherwise, people will run over me in an endless pursuit to get every possible t-shirt that has been handed out. I got only two but I didn’t have to fight. They were given to me by the Analysis Services and Reporting Services team. The RS one is kind of cool, with a “Got spinny?” logo. Perhaps, I can sell it on e-Bay and make a fortune one day…


I expected Steve to do a few stunts on the stage, but I guess he wasn’t in the mood. Business Intelligence was everywhere in Steve’s speech. So, were mobile technologies. I start spotting a trend here. Every show would tout mobile and smart devices as the next pervasive technology. I’ve been waiting for years to find out if the “we are here yet”, but alas… Perhaps, 2005 will be the year of wireless, who knows.


In the afternoon, I went to see Jamie MacLennan’s (Product Manager of SSAS Data Mining) presentation about building intelligent application. He demonstrated how data mining can be used for data validation. Cool stuff! Who know, one day I may actually write such an application.