Friday, 18 November 2011

What's next?


Had a lot of fun in this course, as it was one of the most challenging ones, in which I really had to sit down and think like a business analyst. Even though I've worked as a business analyst before, this was pretty different, considering that I wasn't building a straightforward reporting system, but a business intelligence prototype, with a lot of annoying real life like issues in the data, and data structure.

The readings provided were also thought provoking, and gave a deep insight on how much effort has been put into this field, and how there is still so much more to do, since there hasn't been much innovation done other than marketing innovation, reselling the same technologies, or engineering innovation, improving speed of the servers.


Now that gives me a lot of motivation to research in this field, as I love innovation, thinking outside the box, and bringing new things to the table.


For me, the next step is to complete the BI specialization with HDs in the next semester, by taking F
IT5094 IT for management decision making and FIT5095 Data warehousing.

And yes, I have 
to beat or at least equal Sindy's 98% in Data warehousing!!! :D

After that I'll apply for the
DSS Lab next year, and get me some precious experience. :)

Overall, I'm pretty set on my career path now, means I can fine tune my goals now.

Other than that, vacations are upon us! Time to make some goals for this long wonderful break. I've already printed out my unit guides for the next semester, and will be going to the library tomorrow to borrow the relevant books, especially the following:

  • The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling - Ralph Kimball
  • The Data Warehouse Lifecycle Toolkit - Ralph Kimball
  • Now You See It: Simple Visualization Techniques for Quantitative Analysis  - Stephen Few
  • Show Me the Numbers: Designing Tables and Graphs to Enlighten - Stephen Few

Thanks Steve and Sindy for a great semester, and all the knowledge you guys have given me. I really feel like I've grown to a new level as an analyst. :)

Now its time to enjoy the holidays, and learn new things!!


Tuesday, 15 November 2011

Week 12 Notes


Rise of Business Analytics

Business Analytics is the very similar to BI, and the technology has already been available to us for quite some time.
The difference is that BI provides more of a rear-view mirror approach, focusing on the data, what has been done and what is currently being done
Business Analytics focuses more on what can be done, like predictions, forecasts, how the data can be used usefully.
Even though the technology was available the organizations were not mature enough to ustilize this, plus it used to be a specialized feature, being pretty expensive, however, now the prices are getting much cheaper.

BI from ERP Systems

ERP companies are basically buying out BI companys, and bundling the ERP and solutions together, which is actually not too bad considering that the user gets a cheaper solution off the shelf.
However, the problem is that the application are based on the ERP design standards and requirements, which are quite different from BI ones
ERP requires speed of data entry, update and delete functionality, however, BI requires quick and intuitive access to data, with complex calculations, and attractive reports from various data sources.

BI on your mobile

The issues are:
  • lack of bandwidth
  • Desktop UI was basically squeezed onto a phone screen, not so attractive and useful
  • Most of the current usage is more towards operational levels than strategic.

Innovation

There is very little innovation, more like marketing innovation, reselling the same idea, and some engieering evolutions, making systems and servers faster, however, not much has been done on teh BI itself or the UI.

Overall BI is still profitable, even in an economic crisis, because of the perception of companies that it is an essential source of technology which allows them to do more with less.

Week 11 Notes


Lecture Objectives:
  • Explain the different types of numbers used in BI application user interfaces.
  • Correctly organize and display numbers into tables that suit how the numbers will be used.
  • Correctly organize and display numbers into charts (graphs) that suit how the numbers will be used.
  • Correctly organize tables and charts into reports and dashboard to effectively communication information to a business user.

Numbers


Can be divided mainly into three major groups:
  • Quantities and categories
    • Relationships within Categories
      • Nominal
      • Ordinal
      • Interval
      • Hierarchical
    • Relationships between Categories
      • Ranking
      • Correlation
      • Ratio
  • Numbers that summarise
    • Measure of Average
    • Measures of Distribution
  • Measures of money
    • due to Inflation
    • Currency conversion
Tables


Can be unidirectional or bidirectional.
Are used when we are presenting data in which need to look-up specific numbers, and there is more than a single unit of measure.

For good standardized table we follow the six Ehrenberg rules:
  1. Round the numbers to two significant figures.
  2. Use column and row averages to provide a focus point to the reader
  3. Use columns to allows users to compare figures.
  4. Order your columns and rows, either ascending or descending
  5. You can use space to separate columns, to increase space between things which are not being compared
  6. In case the numbers are important use a table, otherwise if shape is needed then a chart

Charts
A chart is used the shape of the data is important, and we want to show relationships between the  is the message not the numbers themselves. For example when we are trying to show the trend line
A few design rules are:
  • Avoid chart junk, by increasing the data–ink ratio.
  • Use small multiples to highlight comparisons and assess change
  • Use complex graphs to portray complex patterns
  • Relate graph size to information content
  • Use graphical forms that promote comparisons
  • Integrate graphs and text
  • Know the audience

Pie Charts
Should be avoided because:
1. Humans are not good at comparing areas and angles.
2. The segment and clock position are perceived differently, causing misinterpretation.

Reports and Dashboards

Reports are meant to be more static and less interactive, whereas Dashboards are dynamic and meant to be very interactive.

Standard headers, footers, fonts, number and currency formats should be used for reports.

Dashboards are more focused on providing monitoring and measuring information to the user at a glance, hence, we need to take take advantage of pre-attentive processing, to allow better intuitiveness.

Week 10 Notes

Lecture Objectives
  • Explain the role and importance of the user interface in BI applications
  • Explain impact individual user difference can have on the effectiveness of a BI application
  • Explain key principles of good interface usability and data visualization.

Intro
We have to understand that vendors try to sell very weird ideas, like UIs need to be very standardized so that the users don't need to "think" at all. This is so wrong, since we can never fully contemplate what the user wants, hence, the requirements need to come from the user through information gathering techniques like CSF / BSF.

Growing Problems
The problems that the industry is facing are:
  • more licenses have been sold that active users, meaning the BI apps are not being used
  • Excel and Access which gives users another way to access the data, bypassing the BI system
  • BI Analysts need to understand that they have to focus more on the UI, than just the technical back-end, like data modelling and integration.
The Nature of DSSBI systems are meant to be small, personalised focused system, which are able to perform urgent tasks quickly, at the speed of thought.
BI's integral theme is evolutionary, where the user uses the system, they gain knowledge and  understand it better, and along the way their perspective changes, and then the system needs to adapt otherwise it becomes irrelevant to the business is dropped.

Implications on a BI Analyst
  • they need to understand the decision making process and also how executive think
  • need powerful and flexible tools that allow us to quickly and easily make changes to the system based on user requests.
  • need a good infrastructure, both data warehouse and also very skilled and experienced staff members to efficiently work with the system.
  • need to win the trust of the users, in order to make them trust the data and the system itself.Decision makers will not use the system if they can't trust it. Trust means that the users can easily verify the output that they see.
Requirements of a good UI
  • Speed of interaction - user should be able perform operations at the speed of though, and not click and wait.
  • Interface overload - the UI should not be getting in the way of the analysis, and hence should be kept to a minimum.
  • Ease and convenience - users should not need to use manuals, the usage should be intuitive to allow users to quickly adapt.
  • Memorability - users should be able to recall easily how to use the system after being away from the system for sometime.

Week 6 notes

Thompson Diagrams:
Used for logical modelling of multi-dimensional structures in an OLAP system
Basics:


  • Each dimension is represented by a vertical line
  • Each dimension is described independently
  • Every member of a dimension is represented by a unit interval on the line
  • A multi-dimensional model is built by combining the ‘lines’ for the involved dimensions.





However, it's very difficult to show multiple hierarchies, which is why we use ADAPT diagrams.

ADAPT Diagrams:


ADAPT = Application Design for Analytical Processing Technologies

As discussed, it allows for multiple hierarchies, explicit modelling of attributes and also some complex calculations.

Very good from technical perspective, but maybe too complex and technical for business people, who may prefer Thompson's diagram.



COGNOS 24 Ways Approach:

This was pretty useful for me in the assignment, in sorting out which functional areas I can include, plus what kind of business process and KPIs do those functional areas follow.

The main assumption by Cognos is that organisations have a lot in common, and many dimensions are shared across functional and application boundaries. Given that data is available and is of good quality, attention more towards the business that the technical IT side.

Week 5 Notes

Main OLAP rules discussed:

OLAP Rule 1: Multidimensional Conceptual View
Multi-dimensional data models enable more straightforward and intuitive manipulation of data.

OLAP Rule 6: Generic Dimensionality
Every data dimension should be equivalent in its structure and operational capabilities

OLAP Rule 9: Unrestricted Cross-Dimensional Operations
Computational facilities must allow calculation and data manipulation across any number of data dimensions, and must not restrict any relationship between data cells.

OLAP Rule 10:Intuitive Data Manipulation
Data manipulation inherent in the consolidation path, such as drilling down or zooming out (i.e. drilling up), should be accomplished via direct action on the analytical model's cells, and not require use of a menu or multiple trips across the user interface.

Design Contraints:

There are two major constraint when designing your multi-dimensional database::
Business - the report requirements, and the representation of those reports, and moving from a spreadsheet to a multidimensional view
Data - the format, quality, source, and availability of data.


Important Definitions: (Microsoft BI: Making better decisions faster)


business intelligence (BI) An approach to management that allows an organization to define what information is useful and relevant to its corporate decision making. Business intelligence is a multifaceted concept that empowers organizations to make better decisions faster, convert data into information, and use a rational approach to management.


Dimensions: 

A dimension is a structural attribute of a cube that is a list of members, all of which are of a similar type in the user's perception of the data.

Measures: 
Measures are the numeric values that can be aggregated to give meaning to your dimensions.

calculated measure A measure that is calculated or derived from a combination of base measures.

Grain: 
The lowest level of drill-down is called 'grain', and is usually at the transaction level.

cube A multidimensional data structure that represents the intersections of each unique combination of dimensions. At each intersection there is a cell that contains a data value.

database A collection of related data that is organized in a useful manner for easy retrieval. There are different applications of databases depending on the type of data to be stored and how the data is to be used.


data warehouse A repository for data. Many experts define the data warehouse as a centralized data store that feeds data into a series of subject-specific data stores—called data marts. Others accept a broader definition of the data warehouse as a collection of integrated data marts.


desktop online analytical processing (DOLAP) An OLAP storage mode that keeps data on a client's machine and provides local multidimensional analysis.



hierarchy The organization of levels within a dimension that (1) reflects how data is aggregated from detailed levels to summarized levels and (2) serves as the drill-down path for top-down business analysis.


ragged hierarchy A hierarchy that has an inconsistent number of drill-down levels.



hybrid online analytical processing (HOLAP) An OLAP tool that can store data in both multidimensional databases and relational databases.


multidimensional online analytical processing (MOLAP) An OLAP storage mode in which data is placed into special structures that are stored on a central server(s).



online analytical processing (OLAP) Multidimensional analysis that is supported by interface tools and database structures that allow instantaneous access and easy user manipulation. Online analytical processing got its name because this name contrasts well with OLTP, a term that was already in widespread use when the term OLAP was created. There are fundamental differences between transaction processing and analytical processing. OLAP systems support multidimensional analysis at the speed of thought. OLAP typically follows the client/server paradigm, where an OLAP database server is accessed by many users who use multidimensional client tools to analyze data.


online transaction processing (OLTP) A data processing system designed to record all the business transactions of an organization as they occur. OLTP systems are structured for the purposes of running the day-to-day raw data of business, which requires efficiency and minute processing of transactions at the lowest level of detail. An OLTP system processes a transaction, performs all the elements of the transaction in real time, and processes many transactions on a continuous basis. OLTP systems usually offer little or no analytical capabilities.



relational online analytical processing (ROLAP) An OLAP storage mode where data is stored in relational databases.

uniform aggregation A method of summarizing data from its lowest level of detail to its highest level of detail, where data can be aggregated the same way across all dimensions.



semiadditive aggregation A method of summarizing data from its lowest level of detail to its highest level of detail in which measures are not aggregated uniformly across all dimensions.



slice and dice Two complementary methods for interacting with data. Slicing means isolating a specific member of a dimension for analysis. Dicing means breaking a data set into smaller pieces by examining how measures intersect with multiple dimensions.


***Memorize all of these!

Monday, 14 November 2011

Week 3 Notes


Lecture Objectives:

  • Understand the key issues related to five case studies of Australia BI system.
  • Be able to identify some of the key factors that can lead to the failure or the success of a BI system.

Keen's Adaptive Development Model


Overall Key Issues and Conclusions of Cases

  • Cost/Benefit Analysis - this needs to be done for the BI system that is going to be implement to see the feasibility
  • Evolution - the development strategy needs to be evolutionary, inline with the evolutionary application and function nature of BI applications.
  • Data and Data Management - the data and the way it is managed is critical to the success of the BI system, and the infrastructure needs to be flexible.
  • Profit Impact - the BI system can make a significant difference, however it depends on the organization how much profit the system can provide them. BI systems have very short spans of being relevant, after which the highly variant business/organizations change, and the BI system need to be adapted along the way.
  • Cost - need to consider the transaction costs, more than the operational cost of the BI system, considering its not about fixing problems and bugs, but more about a living breathing system, which has both functional and application evolution going on, as per the demands of the users.
  • Project Manager - needs to have experience in handling BI related projects, and also have very good technical, business, creative, and interpersonal skills, and should know how executives think and work.
  • Development team - the team needs to be good in development/programming/tools in general, however, its very important for them to be experienced in the BI environment, plus they should have good business sense, and data organization skills.
  • Politics  - need to be put aside for actual work to continue.
  • Contracts - should be loosely set to allow evolutionary development
  • Feedback - developers mainly complain that users don't provide feedback, which hinders BI application success
  • Commitment from SMs - is a very important CSF
  • Usage - the users are generally convinced about the benefits, however, there is still a lack of actual usage of the system due to lack of trust in the system.
  • System and Data Ownership -  considered a very good practice to separate these two.
  • Testing - generally developers avoid rigorous testing, which causes issues later on.