Datawarehouse loadings are periodic, and summary tables are useful to improve efficiency and the answer time of our reports. A physical optimisation resource that can bring big improvements in our system is the utilization of materialized views.
The view materialized is not more that a view, defined with one sentences SQL, of which in addition to storing his definition, store the data that returns, realizing an initial load and afterwards each true time a freshen of the same.
Like this, if we have a Datawarehouse that update daily, could use views materialized to go updating intermediate tables that feed our diagrams of DWH, or directly to implement tables agregadas that will freshen from our tables base.
The creation of this type of views no so complex as it can seem, the most important is to have clear each how much time want to update the information of the views, and which method of freshen to use.
Also will have to ensure us that our licence of base of data allows us use them (has to be a version Enterprise).
We have prepared virtual servers with a preconfigured installation of Pentaho, the well-known open source platform for Business Intelligence (BI Pentaho PCI).
You can freely download a virtual machine ready with the virtualization software VirtualBox 2.0 Sun, and another with VM 2.0 server software:
Already begin to see the results of the acquisition of Business Objects by part of SAP also for the products orientados to pymes.
Pentaho Has prepared with the version 1.7.1 A new demo online with some reports of proof for the one who want to explore interactivamente some of the analytical possibilities and of reporting of the suite.
The reports are prepared on the data of proof of the fictitious company Steel Wheels, already used by Pentaho for others give.
They have created a folder for each type of report:
Pentaho Dashboards Contains a picture
of interactive control integrated with Google Maps
, and another with a top
have of clients ordered by territory and linea of product
.
In order to be able to make all the steps you must initiate the session in the database with a user with administration permissions, you can use the SYSTEM user, for example
These tablespaces are the location where Oracle will save the objects of the scheme which we are going to create.
Tablespace for data, with initial size of 1024 Mb, and autoextensive
CREATE TABLESPACE "APPDAT" LOGGING DATAFILE '/export/home/oracle/oradata/datafiles/APPDAT.dbf' SIZE 1024M EXTENT MANAGEMENT LOCAL SEGMENT SPACE MANAGEMENT AUTO
CREATE TABLESPACE "APPIDX" LOGGING DATAFILE '/export/home/oracle/oradata/datafiles/APPIDX.dbf' SIZE 512M EXTENT MANAGEMENT LOCAL SEGMENT SPACE MANAGEMENT AUTO
CREATE USER “APP” PROFILE “DEFAULT” IDENTIFIED BY “APPPWD”
DEFAULT TABLESPACE “APPDAT” TEMPORARY TABLESPACE “TEMP” ACCOUNT UNLOCK;
Finally I could attend the Informatica World 2008, now I'll try to explain what I found there. The conference was held under the slogan 'Gain the Edge' from june 3 to 5.
First day began (after breakfast) with a general session entitled Vision. Strategy. Technology Announcements. Industry leadership. At the session both Sohaib Abbasi, President and CEO of Informática, as Chris Boorman, Ivan Chon and Girish Pancha, Vice-presidents in the Marketing, Data quality and Data integration areas respectively, showed their vision of current market, how technology and business are evolving and what role play in this context data and data-related aplicacions.
Much of what was familiar to me, because I had seen before at the Powerday 2008 Barcelona. This is an indicator that the company maintains a well-defined strategy, and shares with its partners.
Each speaker under the perspective of his area made particular emphasis on data value and the importance for each organization of the ability to maintain consistency and quality, ensure on time data availability, protection, sinchronization, and eficient data management and exchange with other organizations.
We noted that now having a Data Warehouse System that delivers new information every certain hour interval is insuficient. Internet and technological development, the globalization and our competitors are the reason. We heard repeatedly terms like SaaS, Real Time and Data Quality, clues about the new features of Powercenter and other tools of the company.
They also made an interesting demo about how an application like Salesforce.com can be synchronized in real time through the internet with a Google Docs spreadsheet. This cloud to to cloud computing example was made showing Salesforce.com on left screen and Google spreadsheet on right screen, each controlled by a laptop. The made a change in Salesforce.com data, and we saw how Google spreadsheet was automatically updated. Then they made a change in the spreadsheet and we saw how Salesforce.com data was also updated. Finally, they repeat the last part, but using an iPod Touch instead of the laptop. We must consider the opportunities that new mobile devices bring when we connect it to Internet.