Microsoft SQL Server 2014 Business Intelligence Development: Beginners Guide
Article Index
Microsoft SQL Server 2014 Business Intelligence Development: Beginners Guide
Review continued; Conclusions

Author: Reza Rad
Publisher: Packt Publishing
Pages: 390
ISBN: 978-1849688888
Print: 1849688885
Kindle: B00KLAJ6OA
Audience: DBAs and developers
Rating: 4.6
Reviewer: Ian Stirk

 

Chapter 5 Master Data Management

Master Data Management (MDM) ensures only one copy of definitive data is kept. The chapter discusses the typical situation were various databases record similar information, but are often out-of-synch. Having a central database of master data removes this data inconsistency. Next, MDS is discussed, this provides features to enact MDM, including validation, security, business rules, and integration with DQS.

The chapter continues with practical exercises/walkthroughs for: 

 

  • Configuring MDS

  • Creating a model and entity

  • Change tracking

  • Creating a business rule (StandardCost > 0)

This chapter provides an overview of MDS which ensures definitive data is stored once only.

 

Chapter 6 Data Quality and Data Cleansing

Related data from different sources can introduce data quality issues. The chapter opens with a look at the need for high-quality data, to ensure algorithms match data better, leading to better decision making. Data problems discussed include: uniqueness, accuracy, and completeness.

Next, the chapter looks at Data Quality Services (DQS), a tool for keeping data clean, based on a Knowledge Base, where data stewards create and maintain policies. The various components of DQS (cleansing, matching, monitoring, profiling) are discussed.

The chapter continues with practical exercises/walkthroughs for: 

 

  • Installing DQS

  • Creating a Knowledge Base

  • Using Data Quality Project for cleansing

  • Integrating with MDS and SSIS

This chapter provides an overview of the importance of clean data, some of the tools and methods for keeping data clean, and the integration with MDS and SSIS.

 

Chapter 7 Data Mining – Descriptive Models in SSAS

Descriptive models analyse existing data, without making predictions. The chapter opens with an overview of data mining, an iterative cycle involving: problem definition, data preparation, data exploration, building and validating models, deploying and updating models. Next, 9 data mining algorithms are briefly explained, namely: Decision tree, Association rule, Clustering, Naive Bayes, Linear regression, Neural network, Logistic regression, Sequence clustering and Time series. Each algorithm has tuneable parameters to help get better results (e.g. set the number of clusters).

The chapter continues with a detailed example on the use of the decision tree algorithm. This identifies nodes and builds trees on those nodes, the nodes are based on decisions on a set of attributes (e.g. age over 40). A useful step-by-step example is given, which builds nodes and trees to identify the attributes that identify buyers of bicycles.

The chapter next provides another useful step-by-step example, this time using the association rules algorithm. This identifies relationships/rules between different variables in a dataset. The aim is to identify products commonly purchased together, which can be useful in cross-selling.

This chapter provides a useful and practical introduction to data mining with descriptive models in SSAS. However, some terms are used before being defined (e.g. training and test case). Also, it might have been useful to link it to the wider realm of Machine Learning and Big Data.

Chapter 8 Identifying Data Patterns – Predictive Models in SSAS

Here the focus moves to identifying data patterns to make predictions, allowing better business decisions. The chapter opens with a look at finding the most suitable algorithm for a dataset, luckily Microsoft provides tools for this (e.g. Lift Chart and Profit Chart) – which compare the results from the different mining models. The chapter provides a step-by-step walkthrough of finding which is the better model (Naive Bayes or Clustering) when targeting a mail shot.

The chapter continues with a look at using the Data Mining Extension (DMX) language to query and work with data mining algorithms. In the example given, a dataset of new customers is queried, to identify prospective bicycle buyers.

The chapter ends with another step-by-step example, using the Time Series mining model, useful for predicting time-based facts (e.g. expected sales in 18 months time).

This chapter provides a practical introduction to data mining with predictive models in SSAS.

Chapter 9 Reporting Services

SQL Server Reporting Services (SSRS) helps users visualize data as reports. The chapter opens with a high-level overview of the reporting services architecture, highlighting: SSRS databases, SSRS Web Applications, and SSRS Reports.

The chapter continues with practical exercises/walkthroughs for: 

 

  • Developing a basic report

  • Extended report development

  • Sorting and grouping

  • Deploying and configuring

 

This chapter provides a useful practical introduction to using SSRS, with helpful examples.

 

Chapter 10 Dashboard Design

Microsoft has various tools for visualization, including SSRS, PerformancePoint, and Excel. This chapter focuses on using PerformancePoint to build dashboards and charts, and Power View to build charts. These tools have the advantage they are more usable to business users.

The chapter provides an overview of PerformancePoint service (part of SharePoint), and Power View (part of SharePoint or Excel), before providing practical exercises, including:

 

  • PerformancePoint service

  • configuring PerformancePoint

  • creating your first dashboard

  • working with filters

  • Power View

  • enabling Power View in Excel

  • creating the first dashboard

  • visualizing time-based information with a scatter chart

 

In each case, a step-by-step walkthrough is provided, with useful instructions and screenshots.

This chapter provides a useful practical introduction to using both PerformancePoint and Power View in dashboard design, the examples should prove especially instructive.

Chapter 11 Power BI

Power BI enables capable business users to perform self-service BI. Power BI consists of various tools, Power View is the main component (discussed in the previous chapter), while this chapter discusses Power Query (self-service ETL tool) and Power Map (3D tool for geospatial data).

The chapter provides an overview of self-service ETL using Power Query, considered a light version of SSIS. A step-by-step example shows how to retrieve smartphone data from the web, apply transformations (e.g. adding extra columns), and combine this with other data sources.

The chapter continues with a look at Power Map, and provides a step-by-step example that produces visually pleasing 3D graphs of house information (e.g. price, size) in Dallas.

This chapter provides a useful introduction to the capabilities of Power Query and Power Map. It should be noted that Power BI components have many more features.

Chapter 12 Integrating Reports in Applications

This chapter outlines the various ways of embedding SSRS reports or other visualization components into .NET applications (i.e. Web, Windows, Metro). The main components described are ReportViewer and data mining models. In each example, a step-by-step walkthrough is provided, with useful instructions and screenshots.

This chapter provides an interesting overview of how SSRS reports or other visualization components can be integrated easily into the various types of .NET applications.

Conclusion

This book aims to explore Microsoft’s Business Intelligence (BI) tools, and succeeds. The book is generally easy to read, with useful explanations, practical step-by-step walkthroughs, helpful screenshots, inter-chapter references, and website links.

There is much more to learn about Microsoft’s BI tools, many of the components discussed here have many other options that have not been discussed, but the book does provide a useful entry point to the most commonly used BI components.

The book does require some previous knowledge of databases systems, so it’s not a guide for the complete beginner - contrary to its title. Sometimes, insufficient information is provided before an example walkthrough (i.e. why we do something), however, useful information is provided after the walkthrough. There are a few areas of bad grammar but nothing too onerous.

It might have been useful to include some comment on the growing importance of Big Data, and Machine Learning. I do wonder if they might lead to the decline of proprietary data warehouses and BI tools in the near future.

If you have some knowledge of database systems (OLAP in particular), this book provides a great entry point to learning Microsoft’s BI tools. Recommended.

Banner


Code: The Hidden Language of Computer Hardware and Software 2nd Ed

Top Book 2023
Author: Charles Petzold
Publisher: Microsoft Press
Date: August 2022
Pages: 480
ISBN: 978-0137909100
Print: 0137909101
Kindle: B0B123P5GV
Audience: General
Rating: 5
Reviewer: Mike James
Code! We all need to know about it.



Expert Performance Indexing in Azure SQL and SQL Server 2022

Author: Edward Pollack & Jason Strate
Publisher: Apress
Pages: 659
ISBN: 9781484292143
Print: 1484292146
Kindle: B0BSWH65ST
Audience: DBAs & SQL devs
Rating: 4 or 1 (see review)
Reviewer: Ian Stirk 

This book discusses indexes, a primary means of improving performance in SQL Server, how does  [ ... ]


More Reviews

 

 



Last Updated ( Wednesday, 06 April 2016 )