<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-1305275264264675951</id><updated>2012-02-15T22:51:31.457-08:00</updated><category term='Transition to Production'/><category term='Business Question Assessment'/><category term='Real time Stuff'/><category term='Project Desc_2'/><category term='Business Case Development'/><category term='Concepts-7'/><category term='Implementation'/><category term='The Data Warehouse Process'/><category term='Metadata Management'/><category term='Tool Selection'/><category term='Data Warehouse Configurations'/><category term='NEW EVOLUTIONS'/><category term='Concepts-3'/><category term='Project Desc_1'/><category term='Concepts-5'/><category term='Concepts-1'/><category term='OLAP Analysis'/><category term='Star Schema Modelling'/><category term='ETL Process Flow'/><category term='OLAP - Examples'/><category term='Detail Design'/><category term='Concepts-6'/><category term='Iteration Project Planning'/><category term='What are ETL Tools?'/><category term='Concepts-4'/><category term='Concepts-2'/><category term='OLAP n its Hybrids'/><category term='Data Profiling'/><category term='OLAP Database'/><category term='What to Learn? : ETL Tools'/><category term='XML ETL Processing'/><category term='Data Quality'/><category term='Architecture Review and Design'/><title type='text'>Datawarehousing Tools</title><subtitle type='html'>Datawarehousing Concepts,Data Warehouse,Data Mart,Star Schema,Snowflake Schema,Fact Table,Database overview, Database Objects,Database Sample Data,SQL,PLSQL, ETL Tools,ETL Concepts,ETL tools-informatica,Transformations,Data Stage, Cognos, Business Objects, Abnitio, Interview questions, Real time Tickets, Faqs with Examples, ANd more and more,,,,</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>33</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-4860377953676094160</id><published>2009-11-02T08:18:00.000-08:00</published><updated>2009-11-02T08:48:41.884-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='NEW EVOLUTIONS'/><title type='text'></title><content type='html'>&lt;div align="left"&gt;&lt;strong&gt;&lt;span style="color:#000099;"&gt;New Evolutions&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;Many vendors in the traditional data consolidation market are positioning their products as either Extract-Transform-Load (ETL), Extract-Load-Transform (ELT), or maybe even Transform-Extract-Load (TEL) tools. Each vendor naturally touts the strengths of their adopted approach, and highlights the weaknesses inherent in those of their competitors.&lt;br /&gt;&lt;br /&gt;So, &lt;strong&gt;which approach is best?&lt;/strong&gt; The truth is that all approaches have their strengths and weaknesses, and it is likely that most organizations will find a need to use a combination of all of these techniques. Therefore, the real key to the alphabet soup of ETL vs. ELT vs. TEL, is flexibility and the ability to support the technique that best suits the job at hand. Molding a data flow that fits well into an ETL architecture into an ELT architecture, just because the tool lacks the ability to adequately support one process or the other, is a recipe for disaster.&lt;br /&gt;&lt;br /&gt;WebSphere DataStage is an inherently flexible data consolidation tool that can natively support ETL, ELT, and TEL topologies. This article shows how the combination of WebSphere DataStage and WebSphere Federation Server can extend the alphabet soup by effectively supporting Transform-Extract-Transform-Load (T-ETL) data consolidation topologies. Within a T-ETL topology, WebSphere DataStage and WebSphere Federation Server complement each other in such a manner that significant performance benefits and CPU savings can be achieved relative to using WebSphere DataStage alone.&lt;br /&gt;&lt;br /&gt;In this scenario, WebSphere Federation Server is able to perform processing close to the input sources so that less data is presented to the extraction stage, and less transformation needs to be done by WebSphere DataStage. This benefit is achieved because the T-ETL architecture plays exactly to the strengths of both products; WebSphere Federation Server for its cost-based optimizer and set processing efficiency in a heterogeneous environment, and WebSphere DataStage for its powerful parallel transformation and data flow engine.&lt;br /&gt;&lt;br /&gt;The following section of this article provides a brief introduction to WebSphere Federation Server before describing the T-ETL architecture in more detail. The subsequent use-case scenarios sections detail four different cases that highlight the benefits of T-ETL. The general traits of WebSphere DataStage jobs that are likely to benefit from this architecture are then summarized.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;&lt;span style="color:#000099;"&gt;Before&lt;/span&gt;&lt;/strong&gt; &lt;strong&gt;&lt;span style="color:#660000;"&gt;&lt;a href="http://datastagetool.blogspot.com/2009/11/websphere-federation-server-compensate.html"&gt;Next&lt;/a&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/div&gt;&lt;div align="left"&gt; &lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-4860377953676094160?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/4860377953676094160/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=4860377953676094160' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/4860377953676094160'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/4860377953676094160'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2009/11/many-vendors-in-traditional-data.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-1749611448296188234</id><published>2008-08-27T00:33:00.000-07:00</published><updated>2008-08-27T00:36:33.977-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Transition to Production'/><title type='text'></title><content type='html'>&lt;span style="color:#990000;"&gt;&lt;span style="font-size:130%;"&gt;&lt;strong&gt;Transition to Production&lt;/strong&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;The Transition to Production stage moves the Data Warehouse development project into the production environment. The production database is created, and the extraction/cleanse/transformation routines are run on the operations system source data.&lt;br /&gt;&lt;br /&gt;The development team works with the Operations staff to perform the initial load of this data to the Warehouse and execute the first refresh cycle.&lt;br /&gt;&lt;br /&gt;The Operations staff is trained, and the Data Warehouse programs and processes are moved into the production libraries and catalogs.&lt;br /&gt;&lt;br /&gt;Rollout presentations and tool demonstrations are given to the entire customer community, and end-user training is scheduled and conducted. The Help Desk is established and put into operation. A Service Level Agreement is developed and approved by the customer organization.&lt;br /&gt;&lt;br /&gt;Finally, the new system is positioned for ongoing maintenance through the establishment of a Change Management Board and the implementation of change control procedures for future development cycles.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-1749611448296188234?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/1749611448296188234/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=1749611448296188234' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1749611448296188234'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1749611448296188234'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/transition-to-production-transition-to.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-8026056554300662255</id><published>2008-08-27T00:31:00.000-07:00</published><updated>2008-08-27T00:32:41.809-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Implementation'/><title type='text'></title><content type='html'>&lt;span style="color:#990000;"&gt;&lt;span style="font-size:100%;"&gt;&lt;strong&gt;Implementation&lt;/strong&gt; &lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;Once the Planning and Design stages are complete, the project to implement the current Data Warehouse iteration can proceed quickly. Necessary hardware, software and middleware components are purchased and installed, the development and test environment is established, and the configuration management processes are implemented.&lt;br /&gt;&lt;br /&gt;Programs are developed to extract, cleanse, transform and load the source data and to periodically refresh the existing data in the Warehouse, and the programs are individually unit tested against a test database with sample source data. Metrics are captured for the load process.&lt;br /&gt;&lt;br /&gt;The metadata repository is loaded with transformational and business user metadata. Canned production reports are developed and sample ad-hoc queries are run against the test database, and the validity of the output is measured. User access to the data in the Warehouse is established.&lt;br /&gt;&lt;br /&gt;Once the programs have been developed and unit tested and the components are in place, system functionality and user acceptance testing is conducted for the complete integrated Data Warehouse system. System support processes of database security, system backup and recovery, system disaster recovery, and data archiving are implemented and tested as the system is prepared for deployment.&lt;br /&gt;&lt;br /&gt;The final step is to conduct the Production Readiness Review prior to transitioning the Data Warehouse system into production. During this review, the system is evaluated for acceptance by the customer organization.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-8026056554300662255?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/8026056554300662255/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=8026056554300662255' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/8026056554300662255'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/8026056554300662255'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/implementation-once-planning-and-design.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-1523162983108590956</id><published>2008-08-27T00:29:00.002-07:00</published><updated>2008-08-27T00:30:26.322-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Detail Design'/><title type='text'></title><content type='html'>&lt;strong&gt;&lt;span style="color:#990000;"&gt;Detail Design&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;In the Detail Design stage, the physical Data Warehouse model (database schema) is developed, the metadata is defined, and the source data inventory is updated and expanded to include all of the necessary information needed for the subject area implementation project, and is validated with users. Finally, the detailed design of all procedures for the implementation project is completed and documented.&lt;br /&gt;&lt;br /&gt;Procedures to achieve the following activities are designed:&lt;br /&gt;&lt;br /&gt;- Warehouse Capacity Growth&lt;br /&gt;&lt;br /&gt;- Data Extraction/Transformation/Cleansing&lt;br /&gt;&lt;br /&gt;- Data Load&lt;br /&gt;&lt;br /&gt;- Security&lt;br /&gt;&lt;br /&gt;- Data Refresh&lt;br /&gt;&lt;br /&gt;- Data Access&lt;br /&gt;&lt;br /&gt;- Backup and Recovery&lt;br /&gt;&lt;br /&gt;- Disaster Recovery&lt;br /&gt;&lt;br /&gt;- Data Archiving&lt;br /&gt;&lt;br /&gt;- Configuration Management&lt;br /&gt;&lt;br /&gt;- Testing&lt;br /&gt;&lt;br /&gt;- Transition to Production&lt;br /&gt;&lt;br /&gt;- User Training&lt;br /&gt;&lt;br /&gt;- Help Desk&lt;br /&gt;&lt;br /&gt;- Change Management&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-1523162983108590956?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/1523162983108590956/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=1523162983108590956' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1523162983108590956'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1523162983108590956'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/detail-design-in-detail-design-stage.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-972104801025826848</id><published>2008-08-27T00:28:00.001-07:00</published><updated>2008-08-27T00:28:51.114-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Iteration Project Planning'/><title type='text'></title><content type='html'>&lt;strong&gt;&lt;span style="color:#990000;"&gt;Iteration Project Planning&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;The Data Warehouse is implemented (populated) one subject area at a time, driven by specific business questions to be answered by each implementation cycle. The first and subsequent implementation cycles of the Data Warehouse are determined during the BQA stage. At this point in the Process the first (or next if not first) subject area implementation project is planned.&lt;br /&gt;&lt;br /&gt;The business requirements discovered in BQA and, to a lesser extent, the technical requirements of the Architecture Design stage are now refined through user interviews and focus sessions to the subject area level. The results are further analyzed to yield the detail needed to design and implement a single population project, whether initial or follow-on.&lt;br /&gt;&lt;br /&gt;The Data Warehouse project team is expanded to include the members needed to construct and deploy the Warehouse, and a detailed work plan for the design and implementation of the iteration project is developed and presented to the customer organization for approval.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-972104801025826848?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/972104801025826848/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=972104801025826848' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/972104801025826848'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/972104801025826848'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/iteration-project-planning-data.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-858456453821603380</id><published>2008-08-27T00:27:00.001-07:00</published><updated>2008-08-27T00:27:57.663-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Tool Selection'/><title type='text'></title><content type='html'>&lt;strong&gt;&lt;span style="color:#990000;"&gt;Tool Selection&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;The purpose of this stage is to identify the candidate tools for developing and implementing the Data Warehouse data and application architectures and for performing technical and support architecture functions where appropriate. Select the candidate tools that best meet the business and technical requirements as defined by the Data Warehouse architecture, and recommend the selections to the customer organization. Procure the tools upon approval from the organization.&lt;br /&gt;&lt;br /&gt;It is important to note that the process of selecting tools is often dependent on the existing technical infrastructure of the organization. Many organizations feel strongly for various reasons about using tools for the Data Warehouse applications that they already have in their "arsenal" and are reluctant to purchase new application packages. It is recommended that a thorough evaluation of existing tools and the feasibility of their reuse be done in the context of all tool evaluation activities. In some cases, existing tools can be form-fitted to the Data Warehouse; in other cases, the customer organization may need to be convinced that new tools would better serve their needs.&lt;br /&gt;&lt;br /&gt;It may even be feasible that this series of activities is skipped altogether, if the organization is insistent that particular tools be used (no room for negotiation), or if tools have already been assessed and selected in anticipation of the Data Warehouse project.&lt;br /&gt;&lt;br /&gt;&lt;span style="color:#000099;"&gt;&lt;strong&gt;Tools may be categorized according to the following data, technical, application, or support functions:&lt;/strong&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;- Source Data Extraction and Transformation&lt;br /&gt;&lt;br /&gt;- Data Cleansing&lt;br /&gt;&lt;br /&gt;- Data Load&lt;br /&gt;&lt;br /&gt;- Data Refresh&lt;br /&gt;&lt;br /&gt;- Data Access&lt;br /&gt;&lt;br /&gt;- Security Enforcement&lt;br /&gt;&lt;br /&gt;- Version Control/Configuration Management&lt;br /&gt;&lt;br /&gt;- Backup and Recovery&lt;br /&gt;&lt;br /&gt;- Disaster Recovery&lt;br /&gt;&lt;br /&gt;- Performance Monitoring&lt;br /&gt;&lt;br /&gt;- Database Management&lt;br /&gt;&lt;br /&gt;- Platform&lt;br /&gt;&lt;br /&gt;- Data Modeling&lt;br /&gt;&lt;br /&gt;- Metadata Management&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-858456453821603380?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/858456453821603380/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=858456453821603380' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/858456453821603380'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/858456453821603380'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/tool-selection-purpose-of-this-stage-is.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-9080842886057813946</id><published>2008-08-27T00:25:00.000-07:00</published><updated>2008-08-27T00:26:46.788-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Architecture Review and Design'/><title type='text'></title><content type='html'>&lt;strong&gt;&lt;span style="color:#990000;"&gt;Architecture Review and Design&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;The Architecture is the logical and physical foundation on which the Data Warehouse will be built. The Architecture Review and Design stage, as the name implies, is both a requirements analysis and a gap analysis activity. It is important to assess what pieces of the architecture already exist in the organization (and in what form) and to assess what pieces are missing which are needed to build the complete Data Warehouse architecture.&lt;br /&gt;&lt;br /&gt;During the Architecture Review and Design stage, the logical Data Warehouse architecture is developed. The logical architecture is a configuration map of the necessary data stores that make up the Warehouse; it includes a central Enterprise Data Store, an optional Operational Data Store, one or more (optional) individual business area Data Marts, and one or more Metadata stores. In the metadata store(s) are two different kinds of metadata that catalog reference information about the primary data.&lt;br /&gt;&lt;br /&gt;Once the logical configuration is defined, the Data, Application, Technical and Support Architectures are designed to physically implement it. Requirements of these four architectures are carefully analyzed so that the Data Warehouse can be optimized to serve the users. Gap analysis is conducted to determine which components of each architecture already exist in the organization and can be reused, and which components must be developed (or purchased) and configured for the Data Warehouse.&lt;br /&gt;&lt;br /&gt;The Data Architecture organizes the sources and stores of business information and defines the quality and management standards for data and metadata.&lt;br /&gt;&lt;br /&gt;The Application Architecture is the software framework that guides the overall implementation of business functionality within the Warehouse environment; it controls the movement of data from source to user, including the functions of data extraction, data cleansing, data transformation, data loading, data refresh, and data access (reporting, querying).&lt;br /&gt;&lt;br /&gt;The Technical Architecture provides the underlying computing infrastructure that enables the data and application architectures. It includes platform/server, network, communications and connectivity hardware/software/middleware, DBMS, client/server 2-tier vs.3-tier approach, and end-user workstation hardware/software. Technical architecture design must address the requirements of scalability, capacity and volume handling (including sizing and partitioning of tables), performance, availability, stability, chargeback, and security.&lt;br /&gt;&lt;br /&gt;The Support Architecture includes the software components (e.g., tools and structures for backup/recovery, disaster recovery, performance monitoring, reliability/stability compliance reporting, data archiving, and version control/configuration management) and organizational functions necessary to effectively manage the technology investment.&lt;br /&gt;&lt;br /&gt;Architecture Review and Design applies to the long-term strategy for development and refinement of the overall Data Warehouse, and is not conducted merely for a single iteration. This stage develops the blueprint of an encompassing data and technical structure, software application configuration, and organizational support structure for the Warehouse. It forms a foundation that drives the iterative Detail Design activities. Where Design tells you what to do; Architecture Review and Design tells you what pieces you need in order to do it.&lt;br /&gt;&lt;br /&gt;The Architecture Review and Design stage can be conducted as a separate project that runs mostly in parallel with the Business Question Assessment stage. For the technical, data, application and support infrastructure that enables and supports the storage and access of information is generally independent from the business requirements of which data is needed to drive the Warehouse. However, the data architecture is dependent on receiving input from certain BQA activities (data source system identification and data modeling), so the BQA stage must conclude before the Architecture stage can conclude.&lt;br /&gt;&lt;br /&gt;The Architecture will be developed based on the organization's long-term Data Warehouse strategy, so that future iterations of the Warehouse will have been provided for and will fit within the overall architecture.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-9080842886057813946?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/9080842886057813946/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=9080842886057813946' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/9080842886057813946'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/9080842886057813946'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/architecture-review-and-design.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-364755925897694660</id><published>2008-08-27T00:24:00.000-07:00</published><updated>2008-08-27T00:25:20.540-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Business Question Assessment'/><title type='text'></title><content type='html'>&lt;span style="color:#990000;"&gt;&lt;strong&gt;Business Question Assessment&lt;/strong&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;Once a Business Case has been developed, the short-term strategy for implementing the Data Warehouse is mapped out by means of the Business Question Assessment (BQA) stage. The purpose of BQA is to:&lt;br /&gt;&lt;br /&gt;- Establish the scope of the Warehouse and its intended use&lt;br /&gt;&lt;br /&gt;- Define and prioritize the business requirements and the subsequent information (data) needs the Warehouse will address&lt;br /&gt;&lt;br /&gt;- Identify the business directions and objectives that may influence the required data and application architectures&lt;br /&gt;&lt;br /&gt;- Determine which business subject areas provide the most needed information; prioritize and sequence implementation projects accordingly&lt;br /&gt;&lt;br /&gt;- Drive out the logical data model that will direct the physical implementation model&lt;br /&gt;&lt;br /&gt;- Measure the quality, availability, and related costs of needed source data at a high level&lt;br /&gt;&lt;br /&gt;- Define the iterative population projects based on business needs and data validation&lt;br /&gt;&lt;br /&gt;The prioritized predator value stream or most important strategic initiative is analyzed to determine the specific business questions that need to be answered through a Warehouse implementation. Each business question is assessed to determine its overall importance to the organization, and a high-level analysis of the data needed to provide the answers is undertaken.&lt;br /&gt;&lt;br /&gt;The data is assessed for quality, availability, and cost associated with bringing it into the Data Warehouse. The business questions are then revisited and prioritized based upon their relative importance and the cost and feasibility of acquiring the associated data.&lt;br /&gt;&lt;br /&gt;The prioritized list of business questions is used to determine the scope of the first and subsequent iterations of the Data Warehouse, in the form of population projects. Iteration scoping is dependent on source data acquisition issues and is guided by determining how many business questions can be answered in a three to six month implementation time frame.&lt;br /&gt;&lt;br /&gt;A "business question" is a question deemed by the business to provide useful information in determining strategic direction. A business question can be answered through objective analysis of the data that is available.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-364755925897694660?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/364755925897694660/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=364755925897694660' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/364755925897694660'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/364755925897694660'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/business-question-assessment-once.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-132808458784037156</id><published>2008-08-27T00:23:00.001-07:00</published><updated>2008-08-27T00:23:56.552-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Business Case Development'/><title type='text'></title><content type='html'>&lt;span style="color:#990000;"&gt;&lt;strong&gt;Business Case Development&lt;/strong&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;A variety of kinds of strategic analysis, including Value Stream Assessment, have likely already been done by the customer organization at the point when it is necessary to develop a Business Case. The Business Case Development stage launches the Data Warehouse development in response to previously identified strategic business initiatives and "predator" (key) value streams of the organization.&lt;br /&gt;&lt;br /&gt;The organization will likely have identified more than one important value stream. In the long term it is possible to implement Data Warehouse solutions that address multiple value streams, but it is the predator value stream or highest priority strategic initiative that usually becomes the focus of the short-term strategy and first run population projects resulting in a Data Warehouse.&lt;br /&gt;&lt;br /&gt;At the conclusion of the relevant business reengineering, strategic visioning, and/or value stream assessment activities conducted by the organization, a Business Case can be built to justify the use of the Data Warehouse architecture and implementation approach to solve key business issues directed at the most important goals. The Business Case defines the outlying activities, costs, benefits, and critical success factors for a multi-generation implementation plan that results in a Data Warehouse framework of an information storage/access system. The Warehouse is an iterative designed/developed/refined solution to the tactical and strategic business requirements.&lt;br /&gt;&lt;br /&gt;The Business Case addresses both the short-term and long-term Warehouse strategies (how multiple data stores will work together to fulfill primary and secondary business goals) and identifies both immediate and extended costs so that the organization is better able to plan its short and long-term budget appropriation.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-132808458784037156?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/132808458784037156/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=132808458784037156' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/132808458784037156'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/132808458784037156'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/business-case-development-variety-of.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-5316963101149282077</id><published>2008-08-27T00:22:00.001-07:00</published><updated>2008-08-27T00:22:50.462-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='The Data Warehouse Process'/><title type='text'></title><content type='html'>&lt;strong&gt;&lt;span style="color:#990000;"&gt;The Data Warehouse Process&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;The james martin + co Data Warehouse Process does not encompass the analysis and identification of organizational value streams, strategic initiatives, and related business goals, but it is a prescription for achieving such goals through a specific architecture.&lt;br /&gt;&lt;br /&gt;The Process is conducted in an iterative fashion after the initial business requirements and architectural foundations have been developed with the emphasis on populating the Data Warehouse with "chunks" of functional subject-area information each iteration. The Process guides the development team through identifying the business requirements, developing the business plan and Warehouse solution to business requirements, and implementing the configuration, technical, and application architecture for the overall Data Warehouse. It then specifies the iterative activities for the cyclical planning, design, construction, and deployment of each population project.&lt;br /&gt;&lt;br /&gt;The following is a description of each stage in the Data Warehouse Process. (Note: The Data Warehouse Process also includes conventional project management, startup, and wrap-up activities which are detailed in the Plan, Activate, Control and End stages, not described here.)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-5316963101149282077?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/5316963101149282077/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=5316963101149282077' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/5316963101149282077'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/5316963101149282077'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/data-warehouse-process-james-martin-co.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-1347133954654191336</id><published>2008-08-27T00:20:00.000-07:00</published><updated>2008-08-27T00:21:49.664-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Data Warehouse Configurations'/><title type='text'></title><content type='html'>&lt;strong&gt;&lt;span style="color:#660000;"&gt;Data Warehouse Configurations&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;A Data Warehouse configuration, also known as the logical architecture, includes the following components:&lt;br /&gt;&lt;br /&gt;- one Enterprise Data Store (EDS) - a central repository which supplies atomic (detail level) integrated information to the whole organization.&lt;br /&gt;&lt;br /&gt;- (optional) one Operational Data Store - a "snapshot" of a moment in time's enterprise-wide data&lt;br /&gt;&lt;br /&gt;- (optional) one or more individual Data Mart(s) - summarized subset of the enterprise's data specific to a functional area or department, geographical region, or time period&lt;br /&gt;&lt;br /&gt;- one or more Metadata Store(s) or Repository(ies) - catalog(s) of reference information about the primary data. Metadata is divided into two categories: information for technical use, and information for business end-users.&lt;br /&gt;&lt;br /&gt;The EDS is the cornerstone of the Data Warehouse. It can be accessed for both immediate informational needs and for analytical processing in support of strategic decision making, and can be used for drill-down support for the Data Marts which contain only summarized data. It is fed by the existing subject area operational systems and may also contain data from external sources. The EDS in turn feeds individual Data Marts that are accessed by end-user query tools at the user's desktop. It is used to consolidate related data from multiple sources into a single source, while the Data Marts are used to physically distribute the consolidated data into logical categories of data, such as business functional departments or geographical regions. The EDS is a collection of daily "snapshots" of enterprise-wide data taken over an extended time period, and thus retains and makes available for tracking purposes the history of changes to a given data element over time. This creates an optimum environment for strategic analysis. However, access to the EDS can be slow, due to the volume of data it contains, which is a good reason for using Data Marts to filter, condense and summarize information for specific business areas. In the absence of the Data Mart layer, users can access the EDS directly.&lt;br /&gt;&lt;br /&gt;Metadata is "data about data," a catalog of information about the primary data that defines access to the Warehouse. It is the key to providing users and developers with a road map to the information in the Warehouse. Metadata comes in two different forms: end-user and transformational. End-user metadata serves a business purpose; it translates a cryptic name code that represents a data element into a meaningful description of the data element so that end-users can recognize and use the data. For example, metadata would clarify that the data element "ACCT_CD" represents "Account Code for Small Business." Transformational metadata serves a technical purpose for development and maintenance of the Warehouse. It maps the data element from its source system to the Data Warehouse, identifying it by source field name, destination field code, transformation routine, business rules for usage and derivation, format, key, size, index and other relevant transformational and structural information. Each type of metadata is kept in one or more repositories that service the Enterprise Data Store.&lt;br /&gt;&lt;br /&gt;While an Enterprise Data Store and Metadata Store(s) are always included in a sound Data Warehouse design, the specific number of Data Marts (if any) and the need for an Operational Data Store are judgment calls. Potential Data Warehouse configurations should be evaluated and a logical architecture determined according to business requirements.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-1347133954654191336?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/1347133954654191336/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=1347133954654191336' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1347133954654191336'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1347133954654191336'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/data-warehouse-configurations-data.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-42033212241252512</id><published>2008-08-27T00:13:00.000-07:00</published><updated>2008-08-27T00:20:48.486-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Project Desc_2'/><title type='text'></title><content type='html'>&lt;strong&gt;Data Warehouses&lt;/strong&gt; can be defined as subject-oriented, integrated, time-variant, non-volatile collections of data used to support analytical decision making. The data in the Warehouse comes from the operational environment and external sources. Data Warehouses are physically separated from operational systems, even though the operational systems feed the Warehouse with source data.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;&lt;span style="color:#000099;"&gt;Subject Orientation&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;Data Warehouses are designed around the major subject areas of the enterprise; the operational environment is designed around applications and functions. This difference in orientation (data vs. process) is evident in the content of the database. Data Warehouses do not contain information that will not be used for informational or analytical processing; operational databases contain detailed data that is needed to satisfy processing requirements but which has no relevance to management or analysis.&lt;br /&gt;&lt;br /&gt;&lt;span style="color:#000099;"&gt;&lt;strong&gt;Integration and Transformation&lt;/strong&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;The data within the Data Warehouse is integrated. This means that there is consistency among naming conventions, measurements of variables, encoding structures, physical attributes, and other salient data characteristics. An example of this integration is the treatment of codes such as gender codes. Within a single corporation, various applications may represent gender codes in different ways: male vs. female, m vs. f, and 1 vs. 0, etc. In the Data Warehouse, gender is always represented in a consistent way, regardless of the many ways by which it may be encoded and stored in the source data. As the data is moved to the Warehouse, it is transformed into a consistent representation as required.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;&lt;span style="color:#000099;"&gt;Time Variance&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;All data in Data Warehouse is accurate as of some moment in time, providing an historical perspective. This differs from the operational environment in which data is intended to be accurate as of the moment of access. The data in the Data Warehouse is, in effect, a series of snapshots. Once the data is loaded into the enterprise data store and data marts, it cannot be updated. It is refreshed on a periodic basis, as determined by the business need. The operational data store, if included in the Warehouse architecture, may be updated.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;&lt;span style="color:#000099;"&gt;Non-Volatility&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;Data in the Warehouse is static, not dynamic. The only operations that occur in Data Warehouse applications are the initial loading of data, access of data, and refresh of data. For these reasons, the physical design of a Data Warehouse optimizes the access of data, rather than focusing on the requirements of data update and delete processing.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-42033212241252512?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/42033212241252512/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=42033212241252512' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/42033212241252512'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/42033212241252512'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/data-warehouses-can-be-defined-as.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-314101000239986612</id><published>2008-08-27T00:01:00.000-07:00</published><updated>2008-08-27T00:09:04.626-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Project Desc_1'/><title type='text'></title><content type='html'>&lt;strong&gt;Description&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;A &lt;strong&gt;Data Warehouse&lt;/strong&gt; is not an individual repository product. Rather, it is an overall strategy, or process, for building decision support systems and a knowledge-based applications architecture and environment that supports both everyday tactical decision making and long-term business strategizing. The Data Warehouse environment positions a business to utilize an enterprise-wide data store to link information from diverse sources and make the information accessible for a variety of user purposes, most notably, strategic analysis. Business analysts must be able to use the Warehouse for such strategic purposes as trend identification, forecasting, competitive analysis, and targeted market research.&lt;br /&gt;&lt;br /&gt;Data Warehouses and Data Warehouse applications are designed primarily to support executives, senior managers, and business analysts in making complex business decisions. Data Warehouse applications provide the business community with access to accurate, consolidated information from various internal and external sources.&lt;br /&gt;&lt;br /&gt;The primary objective of Data Warehousing is to bring together information from disparate sources and put the information into a format that is conducive to making business decisions. This objective necessitates a set of activities that are far more complex than just collecting data and reporting against it. Data Warehousing requires both business and technical expertise and involves the following activities:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;- Accurately identifying the business information that must be contained in the Warehouse&lt;br /&gt;&lt;br /&gt;- Identifying and prioritizing subject areas to be included in the Data Warehouse&lt;br /&gt;&lt;br /&gt;- Managing the scope of each subject area which will be implemented into the Warehouse on an iterative basis&lt;br /&gt;&lt;br /&gt;- Developing a scaleable architecture to serve as the Warehouse’s technical and application foundation, and identifying and selecting the hardware/software/middleware components to implement it&lt;br /&gt;&lt;br /&gt;- Extracting, cleansing, aggregating, transforming and validating the data to ensure accuracy and consistency&lt;br /&gt;&lt;br /&gt;- Defining the correct level of summarization to support business decision making&lt;br /&gt;&lt;br /&gt;- Establishing a refresh program that is consistent with business needs, timing and cycles&lt;br /&gt;&lt;br /&gt;- Providing user-friendly, powerful tools at the desktop to access the data in the Warehouse&lt;br /&gt;&lt;br /&gt;- Educating the business community about the realm of possibilities that are available to them through Data Warehousing &lt;br /&gt;&lt;br /&gt;- Establishing a Data Warehouse Help Desk and training users to effectively utilize the desktop tools&lt;br /&gt;&lt;br /&gt;- Establishing processes for maintaining, enhancing, and ensuring the ongoing success and applicability of the Warehouse&lt;br /&gt;&lt;br /&gt;Until the advent of Data Warehouses, enterprise databases were expected to serve multiple purposes, including online transaction processing, batch processing, reporting, and analytical processing. In most cases, the primary focus of computing resources was on satisfying operational needs and requirements. Information reporting and analysis needs were secondary considerations. As the use of PCs, relational databases, 4GL technology and end-user computing grew and changed the complexion of information processing, more and more business users demanded that their needs for information be addressed. Data Warehousing has evolved to meet those needs without disrupting operational processing.&lt;br /&gt;&lt;br /&gt;In the Data Warehouse model, operational databases are not accessed directly to perform information processing. Rather, they act as the source of data for the Data Warehouse, which is the information repository and point of access for information processing. There are sound reasons for separating operational and informational databases, as described below.&lt;br /&gt;&lt;br /&gt;- The users of informational and operational data are different. Users of informational data are generally managers and analysts; users of operational data tend to be clerical, operational and administrative staff.&lt;br /&gt;&lt;br /&gt;- Operational data differs from informational data in context and currency. Informational data contains an historical perspective that is not generally used by operational systems.&lt;br /&gt;&lt;br /&gt;- The technology used for operational processing frequently differs from the technology required to support informational needs.&lt;br /&gt;&lt;br /&gt;- The processing characteristics for the operational environment and the informational environment are fundamentally different.&lt;br /&gt;&lt;br /&gt;The Data Warehouse functions as a Decision Support System (DSS) and an Executive Information System (EIS), meaning that it supports informational and analytical needs by providing integrated and transformed enterprise-wide historical data from which to do management analysis. A variety of sophisticated tools are readily available in the marketplace to provide user-friendly access to the information stored in the Data Warehouse.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-314101000239986612?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/314101000239986612/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=314101000239986612' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/314101000239986612'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/314101000239986612'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/description-data-warehouse-is-not.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-1584729927557227706</id><published>2008-08-22T01:03:00.000-07:00</published><updated>2008-08-27T00:00:19.836-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='XML ETL Processing'/><title type='text'></title><content type='html'>&lt;strong&gt;&lt;span style="color:#660000;"&gt;XML ETL Processing&lt;/span&gt; &lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;&lt;span style="color:#3333ff;"&gt;XML&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;This section introduces XML (Extensible Markup Language) in data processing and explains basic XML concepts which might help understand the use of various ETL tools to process the XML data.&lt;br /&gt;&lt;br /&gt;The sample business case shows how to implement ETL process to process the XML data from a web-based front-end to an OLTP system and to process the operational data periodically.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;&lt;span style="color:#000099;"&gt;XML data processing concepts&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;The success and broad use of XML mainly derives from it being platform independent, text based, straightforward, easy to understand, human readable and user defined.&lt;br /&gt;By far the most widely known and encountered form of XML is HTML, a markup language for web pages. In data integration, the XML standard is very often used to communicate between applications.&lt;br /&gt;&lt;br /&gt;Most commercial ETL tools have dedicated components to handle the XML processing.&lt;br /&gt;&lt;br /&gt;Each ETL process for the XML processing task comes with its own challenges and requires its own techniques. Most ETL tools usually provide various approaches to process and transform XML data.&lt;br /&gt;&lt;br /&gt;However, most XML ETL processing tasks tend to yield to one of the two ways the data is interpreted and represented:&lt;br /&gt;&lt;br /&gt;Event XML model is a way to interpret XML as a series of events. Each incoming XML data string is treated as a separate entity - an event. In the XML event approach each of those events is converted to a record in a data flow, where each tag and attribute value is stored in a separate field. This model can be set up quickly, it works with every kind of XML document and the elements can be adjusted as needed. However, the event model does not provide the same level of support for XML Schema, in many cases this model ignores some parts of the document which may result in incomplete data. The event model is a simple way of interpreting XML strings, and one that does not involve great effort to set up.&lt;br /&gt;&lt;br /&gt;Full XML parse processing model - at the time of reading the XML document, an ETL tool component knows the exact document structure and parses it accordingly. It requires additional work effort to be done in the preliminary setup phase, however this approach increases performance and ensures the proper mapping of all values and nullable fields. Another outcome of the investment in this approach is a reduced need for populating downstream components as it is far more error proof.&lt;br /&gt;The Full Object model representation is more sophisticated, the most powerful and useful choice when working with complex XML schemas.&lt;br /&gt;&lt;br /&gt;&lt;span style="color:#000099;"&gt;&lt;strong&gt;Business Scenario&lt;/strong&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;The hypothetical manufacturing company uses two operational systems:&lt;br /&gt;&lt;br /&gt;an OLTP system which allows management of the operational data, including invoices, orders, customers, products, etc. The data is entered manually by the customer service representatives and other employees.&lt;br /&gt;&lt;br /&gt;There is also a web-based application where the customers may place purchase orders online. There is no direct integration between the two systems and when an internet user places a new order, the order details are stored in an XML file on the web hosting server. A separate XML file is created for each new order and it contains data from the internet form.&lt;br /&gt;&lt;br /&gt;These purchase orders are stored in a designated directory on the web server and the whole collection is processed on a daily basis (overnight).&lt;br /&gt;&lt;br /&gt;The aim is to load the purchase orders data from the multiple XML files into an operational table with company's orders.&lt;br /&gt;&lt;br /&gt;&lt;span style="color:#993399;"&gt;&lt;strong&gt;Proposed solution&lt;/strong&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;In rough outline, the ETL process involves the following steps:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;1. Move XML source files from the web server to the ETL server (to the unprocessed_files folder)&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;2. Process each of the XML files individually:&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;a. Parse XML file using the event XML approach&lt;br /&gt;&lt;br /&gt;b. Pass through additional data to the processing flow, like the file name which contains dates and a timestamp&lt;br /&gt;&lt;br /&gt;c. Validate records (check for example if a customer exists in the database and if the data is correct)&lt;br /&gt;&lt;br /&gt;d. Assign order number to each purchase order&lt;br /&gt;&lt;br /&gt;e. Feed the OLTP table with open orders&lt;br /&gt;&lt;br /&gt;f. Feed the OLTP table with loading history with a corresponding status&lt;br /&gt;&lt;br /&gt;3. Once the entire XML file is processed correctly, it is moved to the processed_files folder&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-1584729927557227706?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/1584729927557227706/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=1584729927557227706' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1584729927557227706'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1584729927557227706'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/xml-etl-processing-xml-this-section.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-2341626054086504665</id><published>2008-08-21T23:28:00.000-07:00</published><updated>2008-08-21T23:29:39.967-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Real time Stuff'/><title type='text'></title><content type='html'>&lt;span style="color:#990000;"&gt;&lt;strong&gt;Questions on Data Warehousing concept&lt;/strong&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;1. What is Data Warehouse?&lt;br /&gt;&lt;br /&gt;2. What is difference between Data Warehouse and Data Mart ?&lt;br /&gt;&lt;br /&gt;3. What is Star schema?&lt;br /&gt;&lt;br /&gt;4. What is Snow-flake schema?&lt;br /&gt;&lt;br /&gt;5. What is fact and dimension?&lt;br /&gt;&lt;br /&gt;6. What is surrogate key?&lt;br /&gt;&lt;br /&gt;7. What Normlisation ?Explain 3rd Normlised form?&lt;br /&gt;&lt;br /&gt;8. What is the difference between OLTP and OLAP?&lt;br /&gt;&lt;br /&gt;9. Are you involved in data modeling ?If yes which tool/tech you are using?&lt;br /&gt;&lt;br /&gt;10. Which schema modeling techniques you ever used?&lt;br /&gt;&lt;br /&gt;11. What do you mean by summary table?&lt;br /&gt;&lt;br /&gt;12. What Degenerated Dimensions&lt;br /&gt;&lt;br /&gt;13. What is fact less fact?&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;&lt;span style="color:#990000;"&gt;Oracle question based on data warehouse?&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;1 What is parallel execution&lt;br /&gt;&lt;br /&gt;2 What is Bitmap and B-Tree indexes ? Explain Local Vs Global variables&lt;br /&gt;&lt;br /&gt;3 What is materialised view&lt;br /&gt;&lt;br /&gt;4 What is page size/array size in oracle?&lt;br /&gt;&lt;br /&gt;5 What are integrity constraints ?&lt;br /&gt;&lt;br /&gt;6 How can one tune SQL’s in Oracle?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-2341626054086504665?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/2341626054086504665/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=2341626054086504665' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/2341626054086504665'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/2341626054086504665'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/08/questions-on-data-warehousing-concept-1.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-2321361015107672108</id><published>2008-06-30T13:23:00.000-07:00</published><updated>2008-06-30T14:07:33.719-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Star Schema Modelling'/><title type='text'></title><content type='html'>&lt;p&gt;&lt;span style="font-weight: bold;"&gt;&lt;span style="color: rgb(153, 0, 0);"&gt;Star Schema Modelling&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;A Dimensional  modelling technique in which a detail &lt;u&gt;fact table&lt;/u&gt; is linked to &lt;a href="http://dwhcareer.blogspot.com/"&gt;&lt;u style="color: rgb(153, 51, 0);"&gt;dimension  tables&lt;/u&gt;.&lt;/a&gt;&lt;span&gt; &lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;The  data in data warehouses and data marts is accessed by end-users.&lt;span&gt;  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The information contained in the data warehouse/data mart must be  easy for t&lt;/span&gt;&lt;span lang="EN-GB"&gt;he end-user to use and access.&lt;span&gt; &lt;/span&gt;Denormalized designs are  easier for end-users to use than highly normalized designs, however these  designs are more difficult to design and maintain.&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;&lt;br /&gt;The Star Schema &lt;span style="color: rgb(153, 51, 0);"&gt;diagram &lt;/span&gt;graphically models the  end-user's view (i.e., the denormalized view) of how the information is  accessed.&lt;/span&gt;&lt;br /&gt;&lt;span lang="EN-GB"&gt;&lt;strong&gt;&lt;br /&gt;&lt;span style="color: rgb(153, 51, 0);"&gt;&lt;br /&gt;Components of a Star Schema  Diagram&lt;/span&gt;&lt;br /&gt;&lt;/strong&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;&lt;span style="color: rgb(0, 0, 153);"&gt;The diagram has three main  components:&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-GB"&gt;&lt;o:p&gt;&lt;br /&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="bulleteditem" style="margin: 0in 0in 0pt 9pt;"&gt;&lt;span  lang="EN-GB" style="font-family:Symbol;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span&gt;·&lt;/span&gt;&lt;span lang="EN-GB"&gt; &lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Fact  Table and its contents:&lt;/span&gt; metric attributes and the foreign keys necessary to join  to the dimension tables,&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt; &lt;p class="bulleteditem" style="margin: 0in 0in 0pt 2in;"&gt;&lt;span lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt; &lt;p class="bulleteditem" style="margin: 0in 0in 0pt 9pt;"&gt;&lt;span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt; &lt;p class="bulleteditem" style="margin: 0in 0in 0pt 9pt;"&gt;&lt;span&gt;·&lt;/span&gt;&lt;span lang="EN-GB"&gt; &lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Dimension Tables and their contents:&lt;/span&gt; reference attributes,  hierarchical attributes, and metric attributes. The dimension tables are highly  denormalized,&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt; &lt;p class="bulleteditem" style="margin: 0in 0in 0pt 2in;"&gt;&lt;span lang="EN-GB"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt; &lt;p class="bulleteditem" style="margin: 0in 0in 0pt 9pt;"&gt;&lt;span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt; &lt;p class="bulleteditem" style="margin: 0in 0in 0pt 9pt;"&gt;&lt;span&gt;·&lt;/span&gt;&lt;span lang="EN-GB"&gt; The lines that link the Dimension Tables to the Fact Table.&lt;/span&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt; &lt;p class="MsoNormal"&gt;&lt;br /&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;&lt;br /&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;&lt;span style="color: rgb(102, 0, 0);"&gt;Star Schema  Model&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/strong&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;This diagram is a model of  a star schema diagram.&lt;/span&gt;&lt;br /&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp2.blogger.com/_Mh4Mr1vTnZg/SGlKR_GUchI/AAAAAAAAAAs/2YHAkAgHrtU/s1600-h/star+scema+model.jpg"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp2.blogger.com/_Mh4Mr1vTnZg/SGlKR_GUchI/AAAAAAAAAAs/2YHAkAgHrtU/s400/star+scema+model.jpg" alt="" id="BLOGGER_PHOTO_ID_5217783316141601298" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;strong style="color: rgb(102, 0, 0);"&gt;Star Schema Example&lt;/strong&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;&lt;br /&gt;The following is an  example of a star schema for&lt;span style="color: rgb(153, 51, 0);"&gt; sales&lt;/span&gt;  items.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp0.blogger.com/_Mh4Mr1vTnZg/SGlKSFsZllI/AAAAAAAAAA0/8HkrFDZ8K0g/s1600-h/star+schema+example.jpg"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp0.blogger.com/_Mh4Mr1vTnZg/SGlKSFsZllI/AAAAAAAAAA0/8HkrFDZ8K0g/s400/star+schema+example.jpg" alt="" id="BLOGGER_PHOTO_ID_5217783317911934546" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;span style="color: rgb(0, 0, 153);"&gt;NEXT&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-2321361015107672108?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/2321361015107672108/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=2321361015107672108' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/2321361015107672108'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/2321361015107672108'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/star-schema-modelling-dimensional.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://bp2.blogger.com/_Mh4Mr1vTnZg/SGlKR_GUchI/AAAAAAAAAAs/2YHAkAgHrtU/s72-c/star+scema+model.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-470406419559004677</id><published>2008-06-30T12:22:00.000-07:00</published><updated>2008-06-30T12:30:11.987-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Data Profiling'/><title type='text'>CONCEPT OF DATA PROFILING</title><content type='html'>&lt;div style="text-align: center;"&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;PREVIOUS&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;h3&gt;&lt;span style="font-size:100%;"&gt;&lt;span class="mw-headline" style="color: rgb(153, 51, 0);"&gt;Data Profiling&lt;/span&gt;&lt;/span&gt;&lt;/h3&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:100%;"&gt;In 2005 Similarity Systems acquired Evoke  software to add the Evoke Axio profiling tool to the data quality suite and gain  access to a wider customer base in the US and Asia. Similarity was in turn  acquired by Informatica in January of 2006.&lt;br /&gt;&lt;/span&gt;&lt;p&gt;&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:100%;"&gt;Informatica renamed Evoke Axio to  Information Data Explorer and offers it as a stand alone product or as a  companion to PowerCenter. &lt;/span&gt;&lt;/p&gt; &lt;p&gt;&lt;span style="font-size:100%;"&gt;Data Explorer provides data profiling functions  to discover and define the content, structure and quality of data.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:100%;"&gt;It provides  data mapping capabilities for source to target mapping that can be accessed from  within PowerCenter.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt; &lt;p&gt;&lt;span style="font-size:100%;"&gt;PowerCenter also has a &lt;span class="external text"&gt;Mapping Generation Option&lt;/span&gt; that uses the free  Informatica Data Stencil design tool for Microsoft Visio that generates  PowerCenter mappings from Visio designs or reverse engineers a PowerCenter  mapping into a &lt;span style="color: rgb(0, 0, 153);"&gt;Visio mapping&lt;/span&gt; template.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;NEXT&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-470406419559004677?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/470406419559004677/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=470406419559004677' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/470406419559004677'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/470406419559004677'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/concept-of-data-profiling.html' title='CONCEPT OF DATA PROFILING'/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-514087823994043359</id><published>2008-06-30T12:08:00.000-07:00</published><updated>2008-06-30T12:13:42.774-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Data Quality'/><title type='text'>CONCEPT OF DATA QUALITY</title><content type='html'>&lt;div style="text-align: center;"&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;&lt;br /&gt;PREVIOUS&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;h3&gt;&lt;span style="font-size: 100%;"&gt;&lt;span class="mw-headline" style="color: rgb(153, 51, 0);"&gt;Data Quality&lt;/span&gt;&lt;/span&gt;&lt;/h3&gt; &lt;p&gt;&lt;span style="font-size: 100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: 100%;"&gt;In January of 2006 Informatica acquired  Similarity Systems to considerably enhance its data quality and data profile  functionality. The Similarity &lt;span style="color: rgb(0, 0, 153);"&gt;ATHANOR &lt;/span&gt;product has been renamed to  Informatica Data Quality.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: 100%;"&gt;It offers data cleansing, data matching and data  quality reporting and monitoring. nformatica Data Quality processes can be added  to a PowerCenter job within the PowerCenter Mapping Designer by selecting a data  quality building block. &lt;/span&gt;&lt;/p&gt; &lt;p&gt;&lt;span style="font-size: 100%;"&gt;&lt;a href="http://dwhcareer.blogspot.com"&gt;Informatica&lt;/a&gt; relies on additional third party  vendors to deliver address validation capabilities to Data Quality. These  products are licensed separately through Informatica. &lt;/span&gt;&lt;/p&gt; &lt;p&gt;&lt;span style="font-size: 100%;"&gt;There is also PowerCenter &lt;span class="external text"&gt;Data Cleanse and Match option&lt;/span&gt; that was built by  Informatica prior to the Similarity acquisition.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: 100%;"&gt;It offers data cleansing and  parsing and data matching and is fully integrated with PowerCenter to seemlessly  use the Data Partitioning and Enterprise Grid capabilities for scalable  processing. &lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="color: rgb(0, 0, 153);font-size:130%;" &gt;&lt;span style="font-weight: bold;"&gt;NEXT&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-514087823994043359?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/514087823994043359/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=514087823994043359' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/514087823994043359'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/514087823994043359'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/concept-of-data-quality.html' title='CONCEPT OF DATA QUALITY'/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-7185939435720164285</id><published>2008-06-30T11:52:00.000-07:00</published><updated>2008-06-30T12:05:42.451-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Metadata Management'/><title type='text'></title><content type='html'>&lt;h3&gt;&lt;span style="font-size: 100%;"&gt;&lt;span class="mw-headline" style="color: rgb(153, 51, 0);"&gt;Metadata Management&lt;/span&gt;&lt;/span&gt;&lt;/h3&gt; &lt;p&gt;&lt;span style="font-size: 100%;"&gt;Informatica has some technical &lt;a href="http://dwhcareer.blogspot.com"&gt;&lt;span style="color: rgb(0, 0, 153);"&gt;metadata&lt;/span&gt;&lt;/a&gt; functions and reports in the  PowerCenter product to support ETL development. In addition Informatica has the  Metadata Manager and the Metadata Exchange Option. &lt;/span&gt;&lt;/p&gt; &lt;p&gt;&lt;span style="font-size: 100%;"&gt;In 2003 Informatica launched the SuperGlue  metadata management solution for additional metadata capabilities. It is a web  based solution for assembling, visualizing and analyzing metadata from various  enterprise systems.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: 100%;"&gt;In 2005 Informatica launched PowerCenter Advanced Edition  that came bundled with SuperGlue. With the release of PowerCenter 8 Advanced  Edition the SuperGlue name is gone and the product is now known as &lt;span class="external text"&gt;Metadata Manager&lt;/span&gt; and is only available in this  product bundle. &lt;/span&gt;&lt;/p&gt; &lt;p&gt;&lt;span style="font-size: 100%;"&gt;Informatica offers a PowerCenter option called  &lt;span class="external text"&gt;Metadata Exchange&lt;/span&gt; that uses a wizard driven  interface to import source definitions into PowerCenter. It provides more  descriptive metadata and some metadata synchronization functions for when source  metadata definitions change.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: 100%;"&gt;It offers a combination of bi-directional and  uni-directional exchange of metadata to products such as ERwin, Oracle Designer,  Cognos Impromptu and Business Objects Designer.&lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-7185939435720164285?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/7185939435720164285/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=7185939435720164285' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/7185939435720164285'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/7185939435720164285'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/metadata-management-informatica-has.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-1291012068371639682</id><published>2008-06-30T11:31:00.000-07:00</published><updated>2008-06-30T11:36:47.994-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concepts-7'/><title type='text'></title><content type='html'>&lt;div style="text-align: center;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;&lt;br /&gt;PREVIOUS&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;What are mapping parameters and mapping variables?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Mapping parameter &lt;a href="http://dwhcareer.blogspot.com/"&gt;represents &lt;/a&gt;a constant value that you can define before running a session.A mapping parameter retains the same value throughout the entire session.&lt;br /&gt;&lt;br /&gt;When you use the mapping parameter , you declare and use the parameter in a mapping or mapplet.Then define the value of parameter in a parameter file for the session.&lt;br /&gt;&lt;br /&gt;Unlike a mapping parameter, a mapping variable represents a value that can change through out the session. The informatica server save the value of mapping variable to the repository at the end of session run and uses that value next time you run the session.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;Can you use the mapping parameters or variables created in one mapping into another mapping?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;NO, we can use mapping parameters or variables in any transformation of the same mapping or mapplet in which have crated mapping parameters or variables.&lt;br /&gt;&lt;br /&gt;Can you are the mapping parameters or variables created in one mapping into any other result transformation.&lt;br /&gt;&lt;br /&gt;Yes because the reusable transformation is not contained with any mapplet or mapping.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;How the informatica server sorts the string values in rank transformation?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;When the informatica server runs in the ASCII data movement mode it sorts session data using binary sort order.If you configures the session to use a binary sort order, the informatica server calculates the binary value of each string and returns the specified number of rows with the highest binary values for the string.&lt;o:p&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/o:p&gt;&lt;span style="color: rgb(102, 0, 0); font-weight: bold;"&gt;What is the rank index in rank transformation?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;The designer automatically creates a &lt;a href="http://dwhcareer.blogspot.com/"&gt;&lt;span style="font-weight: bold;"&gt;RANKINDEX&lt;/span&gt;&lt;/a&gt; port for each Rank transformation. The informatica server uses the Rank Index port to store the ranking position for each record in a group.For example, if you create a Rank transformation that ranks the top 5 sales persons for each quarter, the rank index number the salespeople from 1 to 5. &lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;&lt;br /&gt;NEXT&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-1291012068371639682?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/1291012068371639682/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=1291012068371639682' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1291012068371639682'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1291012068371639682'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/previous-what-are-mapping-parameters.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-8097295239094113672</id><published>2008-06-30T11:22:00.000-07:00</published><updated>2008-06-30T11:30:45.598-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concepts-6'/><title type='text'></title><content type='html'>&lt;div style="text-align: center;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;&lt;br /&gt;PREVIOUS&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;/div&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="color: rgb(102, 0, 0); font-weight: bold;"&gt;Difference between the source filter and filter?&lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Source filter is filtering the data only relational sources. Where as filter transformation filter the data any type of source.&lt;o:p&gt;&lt;br /&gt;&lt;br /&gt;&lt;/o:p&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;what is a tracing level?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Amount of information sent to log file.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;What are the types of tracing levels?&lt;/span&gt;&lt;st1:city st="on"&gt;&lt;st1:place st="on"&gt;&lt;br /&gt;&lt;br /&gt;Normal&lt;/st1:place&gt;&lt;/st1:City&gt;,Terse,verbose data,verbose intitialization.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;Expalin sequence generator transformation?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Sequence generator transformation is used to generate sequence of numbers. It is substitute to the Primary key. There should be atleast one current value to generate Next value.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(102, 0, 0);font-size:100%;" &gt;&lt;span style="font-weight: bold;"&gt;Can you connect multiple ports from one group to multiple transformations?&lt;/span&gt;&lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;   &lt;/span&gt;Yes&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;Can you connect more than one group to the same target or transformation?&lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt; &lt;/span&gt;NO&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(102, 0, 0); font-weight: bold;"&gt;&lt;br /&gt;What is a reusable transformation?&lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Reusable transformation can be a single transformation.This transformation can be used in multiple mappings.when you need to incorporate this transformation into mapping you add an instance of it to mapping.&lt;br /&gt;&lt;br /&gt;Later if you change the definition of the transformation, all instances of it inherit the changes.Since the instance of reusable transformation is a pointer to that transformation.U can change the transformation in the transformation developer, its instance automatically reflect these changes. This feature can save U great deal of work.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;What are the methods for creating reusable transformation?&lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Two methods&lt;br /&gt;&lt;br /&gt;First method is through Transformation Developer. and Second is by Mapplet.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Design it in the transformation developer.&lt;br /&gt;&lt;br /&gt;Promote a standard transformation from the mapping designer.After you add a transformation to the mapping, you can promote it to status of reusable transformation.&lt;br /&gt;&lt;br /&gt;Once you promote a standard transformation to reusable status, you can demote it to a standard transformation at any time.&lt;br /&gt;&lt;br /&gt;If u change the properties of a reusable transformation in mapping , you can revert it to the original reusable transformation properties by clicking the revert. &lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="color: rgb(0, 0, 153);font-size:130%;" &gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;NEXT&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-8097295239094113672?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/8097295239094113672/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=8097295239094113672' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/8097295239094113672'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/8097295239094113672'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/previous-difference-between-source.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-2100948925332370591</id><published>2008-06-30T11:16:00.000-07:00</published><updated>2008-06-30T11:20:55.567-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concepts-5'/><title type='text'></title><content type='html'>&lt;div style="text-align: center;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;&lt;br /&gt;PREVIOUS&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;&lt;span style="color: rgb(102, 0, 0); font-weight: bold;"&gt;What is default join that source qualifier provides?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Inner equi join.&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(102, 0, 0); font-weight: bold;"&gt;&lt;br /&gt;What are the difference between joiner transformation and source qualifier transformation?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;You can join heterogeneous data sources in joiner transformation, which we cannot achive in source qualifier transformation.&lt;br /&gt;&lt;br /&gt;You need matching keys to join two relational sources in source qualifier transformation.where you doesn’t need matching keys to join two sources.&lt;br /&gt;&lt;br /&gt;Two relational sources should come from same data source in source qualifier.You can join relational sources, which are coming from different sources in source qualifier.You can join relational sources which are coming from different sources also.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;What is update strategy transformation?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Whenever you create the target table whether you are store the historical data or current transaction data in to target table.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;Describe two levels in which update strategy transformation sets?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;what is default source option for update strategy transformation?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Data driven.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;What is data driven?&lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;The informata server follows instructions coded into update strategy transformations with in the session mapping determine how to flag records for insert,update,delete or reject if u do not choose data driven option setting , the informatica server ignores all update strategy transformations in the mapping.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;What are the options in the trarget session of update strategy transformation?&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style=""&gt;  &lt;/span&gt;Insert&lt;span style=""&gt;&lt;br /&gt;  &lt;/span&gt;Delete&lt;span style=""&gt;&lt;br /&gt;  &lt;/span&gt;Update&lt;span style=""&gt;&lt;br /&gt;  &lt;/span&gt;Update as update&lt;span style=""&gt;&lt;br /&gt; &lt;/span&gt; Update as insert&lt;br /&gt;  Update else insert&lt;span style=""&gt;&lt;br /&gt; &lt;/span&gt; Truncate table. &lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;NEXT&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-2100948925332370591?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/2100948925332370591/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=2100948925332370591' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/2100948925332370591'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/2100948925332370591'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/previous-what-is-default-join-that.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-3277023531889084685</id><published>2008-06-30T11:08:00.000-07:00</published><updated>2008-06-30T11:13:03.641-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concepts-4'/><title type='text'></title><content type='html'>&lt;div style="text-align: center;"&gt;&lt;span style="color: rgb(0, 0, 153);font-size:100%;" &gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;PREVIOUS&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;span style="color: rgb(102, 0, 0); font-weight: bold;"&gt;&lt;br /&gt;What is a slowly growing dimension?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Slowly growing dimensions are dimensional data,there dimensions increasing dimension data with out update existing dimensions.That means appending new data to existing dimensions.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;&lt;br /&gt;What is a slowly changing dimension?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Slowly changing dimension are dimension data,these dimensions increasing dimensions data with update existing dimensions.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Type1: &lt;/span&gt;Rows containing changes to existing dimensional are update in the target by overwriting the existing dimension.In the Type1 Dimension mapping, all rows contain current dimension data.&lt;/span&gt;&lt;span style=";font-size:100%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-size:100%;"&gt;Use the type1 dimension mapping to update a slowly changing dimension table when you do not need to keep any previous versions of dimensions in the table.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Type2:&lt;/span&gt; The Type2 Dimension data mapping inserts both new and changed dimensions into the target.Changes are tracked in the target table by versioning the primary key and creating a version number for each dimension in the table.&lt;/span&gt;&lt;span style=";font-size:100%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-size:100%;"&gt;Use the Type2 Dimension/version data mapping to update a slowly changing dimension when you want to keep a full history of dimension data in the table.version numbers and versioned primary keys track the order of changes to each dimension.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;&lt;br /&gt;Type3:&lt;/span&gt; The type 3 dimension mapping filters source rows based on user-defined comparisions and inserts only those found to be new dimensions to the target.Rows containing changes to existing dimensions are updated in the target. When updating an existing dimension the informatica server saves existing data in different columns of the same row and replaces the existing data with the updates.&lt;br /&gt;&lt;br /&gt;When you use for dynamic cache.&lt;br /&gt;&lt;br /&gt;Your target table is also look up table then you go for dynamic cache .In dynamic cache multiple matches return an error.use only = operator.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;What is lookup override?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Override the default SQL statement.You can join multiple sources use lookup override.By default informatica server add the order by clause.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;We can pass the null value in lookup transformation?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Lookup transformation returns the null value or equal to null value.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;what is the target load order? &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style=";font-family:&amp;quot;;font-size:100%;"  &gt;You specify the target load order based on source qualifiers in a mapping.if u have the multiple source qualifiers connected to the multiple targets you can designate the order in which informatica server loads data into the targets.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div style="text-align: center;"&gt;&lt;span style=";font-family:&amp;quot;;font-size:130%;"  &gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;NEXT&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-3277023531889084685?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/3277023531889084685/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=3277023531889084685' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/3277023531889084685'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/3277023531889084685'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/previous-what-is-slowly-growing.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-4906802514191567523</id><published>2008-06-30T09:32:00.000-07:00</published><updated>2008-06-30T09:35:34.797-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concepts-3'/><title type='text'></title><content type='html'>&lt;div style="text-align: center;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;&lt;br /&gt;PREVIOUS&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;How many types of approaches in DHW?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Two approaches: Top-down(Bill-Inmon approach), Bottom-up(Ralph Kimball).&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;Explain Star Schema?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Star Schema consists of one or more fact table and one or more dimension tables that are  related to foreign keys.&lt;br /&gt;&lt;br /&gt;Dimension tables are De-normalized, Fact table-normalized.&lt;br /&gt;&lt;br /&gt;Advantages: Less database space &amp;amp;  Simplify queries.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;Explain Snowflake schema?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Snow flake schema is a normalize dimensions to eliminate the redundancy.The dimension data has been grouped into one large table. Both dimension and fact tables normalized.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;What is confirm dimension?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;If both data marts use same type of dimension that is called confirm dimension.If you have same type of dimension can be used in multiple fact that is called confirm dimension.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;NEXT&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-4906802514191567523?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/4906802514191567523/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=4906802514191567523' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/4906802514191567523'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/4906802514191567523'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/previous-how-many-types-of-approaches.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-6672037293499127267</id><published>2008-06-30T09:17:00.000-07:00</published><updated>2008-06-30T09:23:06.676-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concepts-2'/><title type='text'></title><content type='html'>&lt;div style="text-align: center;"&gt;&lt;span style="color: rgb(0, 0, 153); font-weight: bold;"&gt;PREVIOUS&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;Difference between &lt;/span&gt;&lt;st1:place style="font-weight: bold; color: rgb(102, 0, 0);" st="on"&gt;&lt;st1:placename st="on"&gt;Power&lt;/st1:PlaceName&gt; &lt;st1:placetype st="on"&gt;Center&lt;/st1:PlaceType&gt;&lt;/st1:place&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt; and Power Mart?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Power center receive all product functionality including ability to multiple register servers and metadata across the repository and partition data.&lt;br /&gt;&lt;br /&gt;One repository multiple informatica servers. Power mart received all features except multiple register servers and partition data.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;What is a staging area?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Staging area is a temporary storage area used for transaction, integrated and rather than transaction processing.&lt;br /&gt;&lt;br /&gt;&lt;span style=""&gt;&lt;/span&gt;&lt;span style=""&gt;&lt;/span&gt;When ever your data put in data warehouse you need to clean and process your data.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;Explain Additive, Semi-additive, Non-additive facts?&lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Additive fact:&lt;/span&gt; Additive Fact can be aggregated by simple arithmetical additions.&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Semi-Additive fact:&lt;/span&gt; semi additive fact can be aggregated simple arithmetical&lt;span style=""&gt;.&lt;/span&gt; Additions along with some other dimensions.&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Non-additive fact:&lt;/span&gt; Non-additive fact can’t be added at all.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;What is a Fact less Fact and example?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style=""&gt;&lt;/span&gt;Fact table which has no measures.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;Explain Surrogate Key?&lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Surrogate Key is a series of sequential numbers assigned to be a primary key for the table. &lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;NEXT&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-6672037293499127267?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/6672037293499127267/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=6672037293499127267' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/6672037293499127267'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/6672037293499127267'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/previous-difference-between-power.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-3410168515948242982</id><published>2008-06-30T08:00:00.000-07:00</published><updated>2008-06-30T08:56:15.132-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Concepts-1'/><title type='text'></title><content type='html'>&lt;div style="text-align: center;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;&lt;br /&gt;PREVIOUS&lt;/span&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;What is Data warehouse?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Data warehouse is relational database used for query analysis and reporting. By definition data warehouse is Subject-oriented, Integrated, Non-volatile, Time variant.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Subject oriented :&lt;/span&gt; Data warehouse is maintained particular subject.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Integrated :&lt;/span&gt;&lt;span style="color: rgb(0, 0, 153);"&gt; &lt;/span&gt;Data collected from multiple sources integrated into a user readable&lt;span style=""&gt;&lt;/span&gt;&lt;span style=""&gt;&lt;/span&gt; unique &lt;span style=""&gt;  &lt;/span&gt;format.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Non volatile :&lt;/span&gt; Maintain Historical date.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;Time variant  :&lt;/span&gt; data display the weekly, monthly, yearly.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;What is Data mart?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;A subset of data warehouse is called Data mart.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;Difference between Data warehouse and Data mart?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Data warehouse is maintaining the total organization of data. Multiple data marts used in data warehouse. where as data mart is maintained only particular subject.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;&lt;br /&gt;Difference between OLTP and OLAP?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;OLTP is Online Transaction Processing. This is maintained current transactional data. That means insert, update and delete must be fast.&lt;br /&gt;&lt;br /&gt;OLAP is  Online Analytical Processing which will be used to analyze the business and will be helpful in taking decisions in betterment of the business.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(102, 0, 0);"&gt;Explain ODS?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Operational data store is a part of data warehouse. This is maintained only current transactional data.&lt;br /&gt;&lt;br /&gt;ODS is subject oriented, integrated, volatile, current data.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span style="font-weight: bold; color: rgb(0, 0, 153);"&gt;NEXT&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-3410168515948242982?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/3410168515948242982/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=3410168515948242982' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/3410168515948242982'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/3410168515948242982'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/previous-what-is-data-warehouse-data.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-7558876239318657773</id><published>2008-06-10T06:18:00.001-07:00</published><updated>2008-06-10T06:21:05.392-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ETL Process Flow'/><title type='text'></title><content type='html'>&lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;ETL Concepts&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Extraction, transformation, and loading. ETL refers to the methods involved in accessing and manipulating source data and loading it into target database.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt; &lt;p&gt;The first step in ETL process is mapping the data between source systems and target database(data warehouse or data mart). The second step is cleansing of source data in staging area. The third step is transforming cleansed source data and then loading into the target system.&lt;/p&gt;  &lt;p&gt;Note that ETT (extraction, transformation, transportation) and ETM (extraction, transformation, move) are sometimes used instead of ETL. &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Glossary of ETL &lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Source System&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;A database, application, file, or other storage facility from which the data  in a data warehouse is derived. &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Mapping&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;The definition of the relationship and data flow between source and target  objects. &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Metadata&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;Data that describes data and other structures, such as objects, business rules, and processes. For example, the schema design of a data warehouse is typically stored in a repository as metadata, which is used to generate scripts used to build and populate the data warehouse. A repository contains metadata.&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Staging Area&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;A place where data is processed before entering the warehouse. &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Cleansing&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;The process of resolving inconsistencies and fixing the anomalies in source  data, typically as part of the ETL process. &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Transformation&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;The process of manipulating data. Any manipulation beyond copying is a transformation. Examples include cleansing, aggregating, and integrating data from multiple sources. &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Transportation&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;The process of moving copied or transformed data from a source to a data  warehouse. &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Target System&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;A database, application, file, or other storage facility to which the  "transformed source data" is loaded in a data warehouse.&lt;/p&gt;  &lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(153, 51, 0); font-style: italic;"&gt;Figure 1.12  : Sample ETL Process Flow&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp1.blogger.com/_Mh4Mr1vTnZg/SE5_cNQ1icI/AAAAAAAAAAM/GOLdjKY0X9w/s1600-h/etl.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp1.blogger.com/_Mh4Mr1vTnZg/SE5_cNQ1icI/AAAAAAAAAAM/GOLdjKY0X9w/s400/etl.png" alt="" id="BLOGGER_PHOTO_ID_5210241941487585730" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-7558876239318657773?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/7558876239318657773/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=7558876239318657773' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/7558876239318657773'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/7558876239318657773'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/etl-concepts-extraction-transformation.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://bp1.blogger.com/_Mh4Mr1vTnZg/SE5_cNQ1icI/AAAAAAAAAAM/GOLdjKY0X9w/s72-c/etl.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-9084485301789641671</id><published>2008-06-10T06:16:00.000-07:00</published><updated>2008-06-10T06:17:39.740-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='What to Learn? : ETL Tools'/><title type='text'></title><content type='html'>&lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;ETL Tools: What to Learn?&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;With the help of ETL tools, we can create powerful target Data Warehouses without much difficulty. Following are the various options that we have to know and learn in order to use ETL tools.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt; &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Software:&lt;br /&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to install ETL tool on server/client?  &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Working with an ETL  Tool:&lt;br /&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to work with various options like designer, mapping, workflow,  scheduling etc.,?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to work with sources like DBMS,  relational source databases, files, ERPs etc., and&lt;br /&gt;import the source  definitions?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to import data from data modeling  tools, applications etc.,?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to work with targets like &lt;span style="color: rgb(153, 0, 0);"&gt;DBMS&lt;/span&gt;, relational source databases, files,  ERPs etc., and import the source definitions?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to create target definitions?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to create mappings between source  definitions and target definitions?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to create transformations?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to cleanse the source data?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to create a dimension, slowly changing  dimensions, cube etc.,?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to create and monitor &lt;span style="color: rgb(153, 0, 0);"&gt;workflows&lt;/span&gt;?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to configure, monitor and run  debugger?&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color:red;"&gt;» &lt;/span&gt;How to view and generate metadata reports?  &lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-9084485301789641671?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/9084485301789641671/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=9084485301789641671' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/9084485301789641671'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/9084485301789641671'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/etl-tools-what-to-learn-with-help-of.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-6038660086515370674</id><published>2008-06-07T00:22:00.000-07:00</published><updated>2008-06-07T00:26:18.532-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='OLAP - Examples'/><title type='text'></title><content type='html'>&lt;p&gt;&lt;span style="font-weight: bold; color: rgb(153, 0, 0);"&gt;OLAP - Examples:&lt;/span&gt;  &lt;/p&gt; &lt;p&gt;&lt;br /&gt;Topmost executives of an organization are really interested in aggregated facts or numbers to take decisions rather than querying several databases (that are normalized) to get the data and do the comparison by themselves.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;OLAP tools visualize the data in an understandable format, like in the form of Scorecards and Dashboards with Key Performance Indicators enabling managers to &lt;span style="color: rgb(0, 102, 0);"&gt;&lt;a href="http://dwhcareer.blogspot.com"&gt;monito&lt;/a&gt;r&lt;/span&gt; and take immediate  actions.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;In todays business life, OLAP plays a vital role by assisting decision makers in the field of banking and finance, hospitals, insurance, manufacturing, pharmaceuticals etc., to measure facts across geography, demography, product, and sales.&lt;/p&gt;  &lt;p&gt;OLAP can be performed in data warehouses that undergo frequent updates and that do not. Following are some of the examples to show how &lt;a href="http://dwhcareer.blogspot.com"&gt;&lt;span style="color: rgb(0, 102, 0);"&gt;OLAP&lt;/span&gt;&lt;/a&gt; solves complex queries involving  facts to be measured across company’s best-interested dimensions.&lt;/p&gt;  &lt;p&gt;&lt;br /&gt;&lt;/p&gt;  &lt;ul&gt; &lt;li&gt;Comparison of sales (fact) of a product (dimension) over years (dimension)  in the same region (dimension).&lt;/li&gt; &lt;/ul&gt;  &lt;ul&gt; &lt;li&gt;How may members (fact) have opened a savings account (dimension), in USA  branch (dimension), over a period (dimension) ?&lt;/li&gt; &lt;/ul&gt;  &lt;ul&gt; &lt;li&gt;How many mortgage loans (fact) have been approved in fixed mortgage (dimension) or Adjustable Rate Mortgage (dimension) in New York City (dimension), over a period (dimension) ?&lt;/li&gt; &lt;/ul&gt;  &lt;ul&gt; &lt;li&gt;What is the total sales value (fact) of a particular product (dimension) in a particular grocery store (dimension), over a period (dimension) ?&lt;/li&gt; &lt;/ul&gt;  &lt;ul&gt; &lt;li&gt;What is the amount spent (fact) for a particular product promotion (dimension) in a particular branch (dimension) or in a particular city (dimension), over a period (dimension) ? &lt;/li&gt; &lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-6038660086515370674?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/6038660086515370674/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=6038660086515370674' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/6038660086515370674'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/6038660086515370674'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/olap-examples-topmost-executives-of.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-1174011174197542293</id><published>2008-06-07T00:20:00.000-07:00</published><updated>2008-06-07T00:21:50.791-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='OLAP n its Hybrids'/><title type='text'></title><content type='html'>&lt;p&gt;&lt;span style="font-size:100%;"&gt;&lt;b&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;OLAP  &amp;amp; its Hybrids&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="font-size:100%;"&gt;OLAP, an acronym for &lt;b&gt;Online Analytical  Processing&lt;/b&gt; is an approach that helps organization to take advantages of DATA. Popular OLAP tools are Cognos, Business Objects, Micro Strategy etc. OLAP cubes provide the insight into data and helps the topmost executives of an organization to take decisions in an efficient manner.&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="font-size:100%;"&gt;Technically, OLAP cube allows one to analyze data across multiple dimensions by providing multidimensional view of aggregated, grouped data. With OLAP reports, the major categories like fiscal periods, sales region, products, employee, promotion related to the product can be ANALYZED very efficiently, effectively and responsively. OLAP applications include sales and customer analysis, budgeting, marketing analysis, production analysis, profitability analysis and forecasting etc.&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;b&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;ROLAP&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="font-size:100%;"&gt;ROLAP stands for Relational Online Analytical Process that provides multidimensional analysis of data, stored in a Relational database(RDBMS). &lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="font-size:100%;"&gt;&lt;b&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;br /&gt;MOLAP&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="font-size:100%;"&gt;MOLAP(Multidimensional OLAP), provides the  analysis of data stored in a multi-dimensional data cube.&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="font-size:100%;"&gt;&lt;b&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;br /&gt;HOLAP&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="font-size:100%;"&gt;HOLAP(Hybrid OLAP) a combination of both ROLAP and MOLAP can provide multidimensional analysis simultaneously of data stored in a multidimensional database and in a relational database(RDBMS). &lt;/span&gt;&lt;/p&gt; &lt;span style="color: rgb(0, 0, 153);font-size:100%;" &gt;&lt;b&gt;&lt;span style=""&gt;&lt;br /&gt;DOLAP&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;span style=""&gt;&lt;br /&gt;&lt;br /&gt;DOLAP(Desktop OLAP or Database OLAP)provide multidimensional analysis locally in the client machine on the data collected from relational or multidimensional database servers.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-1174011174197542293?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/1174011174197542293/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=1174011174197542293' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1174011174197542293'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/1174011174197542293'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/olap-its-hybrids-olap-acronym-for.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-412914279341286606</id><published>2008-06-07T00:18:00.000-07:00</published><updated>2008-06-07T00:19:53.689-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='OLAP Analysis'/><title type='text'></title><content type='html'>&lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;OLAP Analysis&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;Imagine an organization that manufactures and sells goods in several States of USA which employs hundreds of employees in its manufacturing, sales and marketing division etc. In order to manufacture and sell this product in profitable manner, the executives need to analyse(OLAP analysis) the data on the product and think about various possibilities and causes for a particular event like loss in sales, less productivity or increase in sales over a particular period of the year.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="font-weight: bold; color: rgb(153, 51, 0);"&gt;&lt;br /&gt;During the OLAP analysis, the  top executives may seek answers for the following:&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;br /&gt;1. Number of products manufactured.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;2. Number of products manufactured in a location.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;3. Number of products manufactured on time basis within a location.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;4. Number of products manufactured in the current year when compared to the  previous year.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;5. Sales Dollar value for a particular product.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;6. Sales Dollar value for a product in a location.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;7. Sales Dollar value for a product in a year within a location.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;8. Sales Dollar value for a product in a year within a location sold or  serviced by an employee.&lt;br /&gt;OLAP tools help executives in finding out the answers, not only to the above mentioned measures, even for the very complex queries by allowing them to slice and dice, drill down from higher level to lower level summarized data, rank, sort, etc. &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;Example of OLAP Analysis  Report&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt; &lt;table class="lookup4" cellpadding="2" cellspacing="1"&gt; &lt;tbody&gt; &lt;tr&gt; &lt;td class="header4"&gt;Time Dimension Id&lt;/td&gt; &lt;td class="header4"&gt;Location Dimension Id&lt;/td&gt; &lt;td class="header4"&gt;Product Dimension Id&lt;/td&gt; &lt;td class="header4"&gt;Organization Dimension Id&lt;/td&gt; &lt;td class="header4"&gt;Sales Dollar&lt;/td&gt; &lt;td class="header4"&gt;DateTimeStamp&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;1&lt;/td&gt; &lt;td&gt;1&lt;/td&gt; &lt;td&gt;100001&lt;/td&gt; &lt;td&gt;1&lt;/td&gt; &lt;td&gt;1000&lt;/td&gt; &lt;td&gt;1/1/2005 11:23:31 AM&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;3&lt;/td&gt; &lt;td&gt;1&lt;/td&gt; &lt;td&gt;100001&lt;/td&gt; &lt;td&gt;1&lt;/td&gt; &lt;td&gt;750&lt;/td&gt; &lt;td&gt;1/1/2005 11:23:31 AM&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;1&lt;/td&gt; &lt;td&gt;1&lt;/td&gt; &lt;td&gt;100001&lt;/td&gt; &lt;td&gt;2&lt;/td&gt; &lt;td&gt;1000&lt;/td&gt; &lt;td&gt;1/1/2005 11:23:31 AM&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;3&lt;/td&gt; &lt;td&gt;1&lt;/td&gt; &lt;td&gt;100001&lt;/td&gt; &lt;td&gt;2&lt;/td&gt; &lt;td&gt;750&lt;/td&gt; &lt;td&gt;1/1/2005 11:23:31 AM&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;/p&gt;  &lt;p&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;In the above example of OLAP analysis, data can be sliced and diced, drilled up and drilled down for various hierarchies like time dimension, location dimension, product dimension, and organization dimension .&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;This would provide the topmost executives to take a decision about the  product performance in a location/time/organization.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;In OLAP reports, Trend analysis can be also made by comparing the sales value of a particular product over several years or quarters.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-412914279341286606?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/412914279341286606/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=412914279341286606' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/412914279341286606'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/412914279341286606'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/olap-analysis-imagine-organization-that.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-8741567373586095383</id><published>2008-06-07T00:17:00.000-07:00</published><updated>2008-06-07T00:18:18.632-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='OLAP Database'/><title type='text'></title><content type='html'>&lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;OLAP Database - Multidimensional&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;This is a type of database that is optimized for data warehouse, data mart and online analytical processing (OLAP) applications. The main advantage of this database is query performance.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt; &lt;p&gt;Relational databases make it easy to work with individual records, whereas multidimensional databases are designed for analyzing large groups of records. Relational database is typically accessed using a Structured Query Language (SQL) query.&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;A multidimensional database allows a user to ask questions like "How many mortgages have been sold in New Jersey city" and "How many credit cards have been purchased in a particular county?". &lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p style="font-weight: bold;"&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;Popular  Multidimensional Databases&lt;/span&gt;&lt;/p&gt;  &lt;p&gt;&lt;br /&gt;&lt;/p&gt;  &lt;p&gt;&lt;span style="color: rgb(0, 51, 153);"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt; &lt;table cellpadding="3" cellspacing="1"&gt; &lt;tbody&gt; &lt;tr&gt; &lt;td class="special"&gt;Database Name&lt;/td&gt; &lt;td class="special"&gt;Company Name&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;Crystal Holos&lt;/td&gt; &lt;td&gt;Business Objects&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;Hyperion Essbase&lt;/td&gt; &lt;td&gt;Hyperion&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;Oracle Express&lt;/td&gt; &lt;td&gt;Oracle Corporation&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;Oracle OLAP Option&lt;/td&gt; &lt;td&gt;Oracle Corporation&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;AWMicrosoft Analysis Services&lt;/td&gt; &lt;td&gt;Microsoft&lt;/td&gt;&lt;/tr&gt; &lt;tr&gt; &lt;td&gt;PowerPlay Enterprise&lt;/td&gt; &lt;td&gt;Cognos&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;&lt;/p&gt; &lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-8741567373586095383?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/8741567373586095383/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=8741567373586095383' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/8741567373586095383'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/8741567373586095383'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/06/olap-database-multidimensional-this-is.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-1305275264264675951.post-5930306773349194016</id><published>2008-05-20T23:27:00.000-07:00</published><updated>2008-05-20T23:40:31.964-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='What are ETL Tools?'/><title type='text'></title><content type='html'>&lt;strong&gt;&lt;span style="color:#660000;"&gt;What are ETL Tools? &lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;strong&gt;&lt;span style="color:#660000;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/strong&gt;ETL Tools are meant to extract, transform and load the data into Data Warehouse for decision making. Before the evolution of ETL Tools, the above mentioned ETL process was done manually by using SQL code created by programmers.&lt;br /&gt;&lt;br /&gt;This task was tedious and cumbersome in many cases since it involved many resources, complex coding and more work hours. On top of it, maintaining the code placed a great challenge among the programmers.&lt;br /&gt;&lt;br /&gt;These difficulties are eliminated by ETL Tools since they are very powerful and they offer many advantages in all stages of ETL process starting from extraction, data cleansing, data profiling, transformation, debuggging and loading into data warehouse when compared to the old method.&lt;br /&gt;There are a number of ETL tools available in the market to do ETL process the data according to business/technical requirements.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Following are some those.&lt;/strong&gt;&lt;br /&gt;&lt;strong&gt;&lt;span style="color:#660000;"&gt;&lt;/span&gt;&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;&lt;div align="center"&gt;&lt;span style="font-size:130%;color:#660000;"&gt;&lt;strong&gt;&lt;em&gt;&gt;&gt;Next&gt;&gt;&lt;/em&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/1305275264264675951-5930306773349194016?l=dwhcareer.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://dwhcareer.blogspot.com/feeds/5930306773349194016/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=1305275264264675951&amp;postID=5930306773349194016' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/5930306773349194016'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/1305275264264675951/posts/default/5930306773349194016'/><link rel='alternate' type='text/html' href='http://dwhcareer.blogspot.com/2008/05/what-are-etl-tools-etl-tools-are-meant.html' title=''/><author><name>DataWarehousing Experts</name><uri>http://www.blogger.com/profile/07332280175395072730</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
