While many people think only organizations with large volumes of data can benefit from an ETL system, any company that has diverse data sources or legacy systems they would like to access can leverage ETL. An ETL system is not a database server and doesn’t create the data; it does however, enable you to access, verify, cleanse and organize that data in a robust data repository.
The data repository ETL builds can be anything from a single database server to a full data warehouse infrastructure. Once the data repository is built, the data can be used for anything from business intelligence reports to technical analysis to dashboards. Those reports, analytics and dashboards will enable your organization to more effectively leverage existing assets and enhance profitability.
The first step in an ETL system is getting to your existing data. Logi ETL provides integrated support for connecting to a wide variety of sources including relational databases, flat files, spreadsheets, Web services, Web pages, and even the new "Web 2.0" data stores (Google, Amazon SimpleDB, Salesforce.com, etc).
Once the data has been extracted, an ETL system applies any required data validation, cleansing, and format manipulation. While some sources may require little or no processing, others may require some transformation. Logi ETL includes a wide assortment of transformation features including validation, sorting, filtering, aggregation, grouping, calculation, code translation, and joining to name a few.
Now that the data has been verified, cleansed, and formatted for the destination system, it is ready for loading. Logi ETL can load data into a wide variety of database systems including Oracle, MySQL, Microsoft SQL Server, PostgreSQL, DB2, Informix, Sybase, and any OLEDB or ODBC system. It also includes many options for optimizing the loading based on your IT infrastructure and even includes a high speed bulk loading option.
To boost productivity, Logi ETL employs an elemental development methodology like Logi Studio, Logi Info’s development environment. This means that building ETL jobs and tasks is accomplished through a library of pre-built, reusable elements that require no coding on the part of the developer. |
|