Newsletters   Subscriptions  Forums  Store   Career  Media Kit  About Us  Contact  Search   Home 
fhs
Volume 5, Number 43 -- November 1, 2005

Informatica Aims to Virtualize Data with PowerCenter 8


by Alex Woodie


There has been a lot of talk lately about virtualizing hardware and running multiple operating system images on the same machine to make it easier to manage applications. Informatica is now applying that general idea to data transformations with a new version of its flagship PowerCenter product unveiled this week. With PowerCenter 8, the company is looking to break down the barriers to building pipelines that connect physical repositories of data and applications accessing the data.

Informatica's PowerCenter product began life as an extract, transformation, and load (ETL) tool for building data warehouses from data stored in the transactional systems of large corporations. While business intelligence is a booming area and continues to drive demand for ETL tools, Informatica is also seeing customers using PowerCenter in other ways, like migrating to new ERP systems and building "competency centers" that set the standard in a company for how data is managed, often with regulatory compliance as the driving factor.

But whatever the need for a data transformation tool like PowerCenter, the amount and type of data continues to explode, which means that PowerCenter needs to grow with it. This is what Informatica has sought to do with PowerCenter version 8, which has been in development for more than a year under the "Zeus" codename. The highlights of PowerCenter 8 can be broken down into three major new features, including data virtualization, support for unstructured data, and grid enablement.

Data Virtualization

The new data virtualization capability is expected to help companies complete tough IT tasks, like quickly integrating new data following an acquisition. PowerCenter 7, like many other data transformation products on the market, enabled companies to physically move data from one place to another, like from a DB2/400 production system to a SQL Server-based data warehouse.

With the new data virtualization capability in PowerCenter 8, data can remain on the physical production system, while also appearing to reside on other systems, says Ivan Chong, product marketing manager with the Redwood City, California, company.

For example, one of the early testers of PowerCenter 8 was a bank that has a growth strategy that involves making acquisitions. "The challenge they had is the call center had to immediately provide an aggregate view of the new customers across all systems," Chong says. "With PowerCenter 8, the developer can specify the application once, and then change the mode by which it's accessed, physical or virtual."

Data virtualization provides customers with greater flexibility, Chong says. While having the data housed locally may provide better performance, physically moving the data from one system to another takes time and effort from IT professionals. By virtualizing the data across all supported data sources and targets, customers enjoy more choices in terms of how they access it.

Data Federation

Not all pertinent data is stored in relational databases or other structured forms of information, such as XML. In fact, up to 90 percent of the information that companies use resides in unstructured or semi-structured formats, such as Word documents, Excel spreadsheets, Web pages, e-mails, HIPAA documents, and PDF documents, according to Informatica.

With the amount of unstructured and semi-structured data increasing dramatically, Informatica felt the need to increase this area of support within PowerCenter 8. Users can bring their unstructured data into the PowerCenter fold by using the product's parsing designer, which basically lets them do one-off mappings of certain specified documents.

For example, say a given company's business process requires it to access a credit report before clearing an order, Chong says. Building the pipelines that enable a user to quickly assemble the documents necessary to form a decision, including the Dunn and Bradstreet credit report, is easier with PowerTerm 8, he says.

Grid Enablement

PowerCenter 8 also includes new grid computing capabilities that will benefit users by making high availability a built-in feature of the product. While grid features were supported in the previous release, PowerCenter 8 includes more intelligence that enables the product to more effectively handle the grid resources users have allotted to their PowerCenter implementation.


Grid infrastructures make it easy for users to scale up on the hardware side by simply plugging another blade into the system, Chong says. "The problem is software hasn't scaled with the grid. A customer has to specify in applications where processing is to be applied. Machines come and go, data volumes spike, and every time, a developer must decide how to rework it," he says. "With PowerCenter 8, we allow developers to specify logic, but PowerCenter takes responsibility for detecting and exploiting where processing takes place." In short, this means that PowerCenter 8 now has built-in failover capabilities.

PowerCenter 8's new grid awareness has been put to the test at LinkShare, a company that connects e-commerce Web sites with advertising, and maintains a multi-terabyte data warehouse on a Linux DB2 cluster. "They did a beta test of our grid capability and verified the automated parallelization of grid infrastructure is inline with their requirements," Chong says.

In addition to these new features, PowerCenter 8 received several other enhancements, including the new "push-down" capability that enables data transformation processing to be sent to the target relational database, instead of being performed by the PowerCenter server. This release also brings new support for importing, creating, compiling, and debugging transformation routines that have been written in Java (previous releases relied entirely on visual templates, Chong says), as well as a new Web-based console designed to automated repetitive administrative and performance-tuning tasks.

While PowerCenter agents support many different targets and sources of data, including OS/400 applications and DB2/400 data, the PowerCenter engine installs on Windows, AIX, HP-UX, Solaris, Linux, and z/OS servers. The company has considered a native OS/400 port but currently has no plans to finalize such a product.

PowerCenter 8 should be available on a limited First Customer Ship (FCS) basis in December, with general availability planned for April 2006. The company goes through a FCS stage to ensure that real-world guidelines and reference implementations are in place before widespread adoption. The standard edition of PowerCenter 8 will start at $140,000, while customers will pay $40,000 more for the Advanced Edition, which is required for grid enablement and brings additional reporting, metadata analysis, and team development functions.

PowerCenter 8 marks the completion of the second stage of a three-part roadmap Informatica unveiled in February (see "Informatica Unveils 18-Month Roadmap for Enterprise ETL"). The final stage of that plan involves the development of "Hercules," the codename for the next release of PowerCenter. Initially slated for a fall 2006 release, Hercules, which is focused on completing the product's SOA story, has been pushed back to 2007, Chong says.


This article has been corrected since it was first published. Two spelling errors were corrected. IT Jungle regrets the errors. [Corrections made 11/01/05.]

Sponsored By
ARCAD SOFTWARE

THE ARCAD PRODUCT LINE
Since the company's foundation, ARCAD Software has always highlighted the its products' level of performance and functional richness, with a constant emphasis on overall quality and client satisfaction. The resulting products are the expression of this ambition, being the only solution with such broad operational capacity and such a high level of integration.

Four complementary suites built on a common core:

ARCAD-Open Repository: the heart of ARCAD technology
The ARCAD suites are built around a single central repository, or application knowledge base. For either externally developed or in-house software, it the repository offers gives a uniform view of the entire legacy application and its inter-dependencies, regardless of whatever the technology used (iSeries, Windows, UNIX, Linux). Using the integrated audit functions of ARCAD-Open Repository, application restructuring projects are rendered become simple and efficient. Many customers have reduced the volume of code in maintenance by 50%.

ARCAD-Skipper: the structured organization of application change
The ARCAD-Skipper suite is in the Application Lifecycle Management (ALM) market covering Software Configuration Management methodologies, version control, deployment and test. Right from user request handling up to the transfer of new versions to production, ARCAD-Skipper secures a general framework for all change processes.

ARCAD-Observer: Capitalizing on functional and technical knowledge
The ARCAD-Observer suite is a concrete solution for maintaining and transferring the knowledge of existing systems. Whether the context is application maintenance or modernization, the basic needs are the same: to find information within complex application architectures and have ready access to technical documentation that is guaranteed relevant and up-to-date.

ARCAD-Qualifier: Automation of the test process
ARCAD-Qualifier is a wide-reaching suite for software testing. It covers the extraction of coherent production data for testing purposes, test coverage measurement, and management of regression test scenarios to ensure automatic replay after an application change. Its main advantage is that it is unique in being built specifically built for the specific needs of iSeries users.

ARCAD-Customer: Your service quality architect
The ARCAD-Customer Help-Desk is a latest generation technical support management suite, offering more services with the same simplicity of use. ARCAD-Customer takes end-user relations to a new level and extends management and control over the technical support activity.

NEW RELEASE of the ARCAD Software suites with the Version 8.04
Another highly innovative version of the ARCAD suites, the V8.04 brings unique new features right across the range:
ARCAD-Open Repository: The Heart of the ARCAD technology
New Dashboard module

With the new Dashboard module you are able to effectively pilot the changes to your information system. IT professionals are provided with a clear, synthesized indicators such as:
· Code volume
· Complexity rate (Mc Cabe cyclomatic complexity)
· Comment rate
· Modification rate
The Dashboard module also provides qualitative indicators such as the average time to resolution of incidents, as well as project tracking indicators such as maintenance workloads. Data can be regularly logged in order to produce trend analyses.

ARCAD-Skipper: Multi-platform Software Configuration Management suite
WDSc Plug-in

This version sees a major consolidation of our plug-in for the WDSc development environment.
Amongst the new features are:
Access to the full set of cross-references (components, fields, calling chains). In a multi-platform context, the cross-platform dependencies are also accessible. You can also manually add dependencies into the ARCAD repository.
All these usage details have been worked and reworked in a constant improvement of comfort and productivity: including the retrieval of job completion messages, examination of spools, return of compilation results in the sources, and many more.
WDSc in conjunction with its ARCAD plug-in has quickly become a stable and productive environment. Your feedback shows us an ever faster adoption rate as the advantages of this interface become obvious.
Parameter consistency check
This new feature facilitates the swift regain of application reliability following a major change. Concretely, you are able to check that the parameters between two programs or ILE procedures are consistent in terms of number, length or type. This way, you can avoid the classic error messages that slow down your testing unnecessarily. The productivity gains are major as parameter mismatches are one of the most common causes of error.
ILE and the procedure repository
The ILE environment has made a lot of headway in companies recently. This is only to be expected as it brings with it all the advantages of code modernization. Of course the changeover means that developers are finding themselves confronted with new needs. For example, for full control over modularization, highly detailed cross references are needed. For this reason the new ARCAD V8.04 contains a procedure repository and procedure/program cross references. This doubles as a powerful tool for putting reusability concepts into practice.
Of course, these new cross-references can also be accessed from the ARCAD Client graphical interface. Note that all ARCAD-Skipper users that do not have ARCAD-Observer can install this interface on all their PCs and thereby profit from the ARCAD repository in graphical mode.

ARCAD-Verifier: Automation of Test suite
Significant improvements have been made in the scenario maintenance processes. This way, if you have added or deleted entry fields from a single screen, it is extremely easy to modify your screen entry. You can then resume your scenario as before.
ARCAD-Verifier has found its natural place in the software supply chain implemented by the ARCAD suites. It fills a profound lack in the market and is already leading sales in this area.

ARCAD-Observer: Retro-documentation and Application Mining suite
For those of you that already use the Source Code Analyzer and have experienced some performance problems when analyzing high-volume programs, we have some good news for you. You are now able to preload the analysis results. This way, the time to access the module is reduced to a simple loading of source on your PC.

Visit our Web site for our free ARCAD evaluation software.


Editor: Alex Woodie
Contributing Editors: Dan Burger, Joe Hertvik,
Shannon O'Donnell, Timothy Prickett Morgan
Publisher and Advertising Director: Jenny Thomas
Advertising Sales Representative: Kim Reed
Contact the Editors: To contact anyone on the IT Jungle Team
Go to our contacts page and send us a message.


THIS ISSUE
SPONSORED BY:

California Software
ARCAD Software
iTera
BCD Int'l
Affirmative Computer


Four Hundred Stuff

BACK ISSUES

TABLE OF
CONTENTS
Kronos Tackles Unscheduled Absenteeism with Labor Software

Informatica Aims to Virtualize Data with PowerCenter 8

AVR Provides a Better Alternative Than Screen Scraping, ASNA Says

Cyberscience Unveils Reporting Tool for iSeries Apps

News Briefs and Product Shorts


The Four Hundred
iSeries Salaries Are Shaping Up to Rise 2006

IBM Identifies Hot Markets for iSeries Growth

Readers Weigh in on the Hypothetical System i5 for SMB

Four Hundred Guru
How to Count with SQL

Those Stupid Quotation Marks!

Admin Alert: New TCP/IP Functions to Check Out When Upgrading to i5/OS V5

Four Hundred Monitor


Copyright © 1996-2008 Guild Companies, Inc. All Rights Reserved.
Guild Companies, Inc. (formerly Midrange Server), 50 Park Terrace East, Suite 8F, New York, NY 10034
Privacy Statement