Data Integration

Data Lakes – Is it Time for Your Business to Wade In?

As data continues to grow in both volume and structural variety, traditional relational database approaches fall increasingly short in providing the needed flexibility, agility, scalability, and economy to support its processing.  Alternative and complimentary approaches for managing information have been pioneered, and given time to mature, in the last few years to satisfy today’s big data storage and processing needs. Most prominent among them for centrally managing the onslaught of all the information a business needs to process and store are Data Lakes.

What is the purpose of a Data Lake?

Data Lakes offer a far more economic and imminently scalable approach for ingesting and assimilating an ever changing range of input data primarily because they can be implemented on top of the open source Hadoop eco system. Hadoop provides an architecture that can scale as needed by simply adding commodity servers to the cluster for increased parallel processing and storage.  Due to […]

ThoughtSpot – For Near Instant Analytics Gratification

ThoughtSpot ups the ante when it comes to rapidly and effortlessly delivering insightful and completely ad-hoc data analytics and visuals to your business, even for large many TB data sets.

ThoughtSpot has trail blazed a new area of BI called Search BI. This type of BI differs from the current genre of more established BI tools such as Tableau in that it embeds and applies knowledge about how data of different categories is generally analyzed and most effectively visualized. This knowledge is then mapped onto your business’s specific domain data.   The alignment and cataloguing of the business domain data and Meta data is then used to provide an optimized, intelligent and guided search capability through it.  A business user simply begins typing what they are looking for into the search box and then ThoughtSpot offers completions of the search as the user types.  The suggested completions are offered in the order that […]

Technical Deep Dive in Informatica V10

Informatica V10 – Informatica Introduces New Release of Industry-Leading Data Management Platform Built for Modern Data Architectures.

In the launch of Informatica V10, specifically Informatica PowerCenter, Informatica Data Quality and Informatica Data Integration Hub Lead the Pack with Enhanced Agility and Performance combined with New Features

  • Up to 50x faster data lineage generation
  • Up to 5x faster data ingestion and cleansing
  • Increased flexibility and agility for hybrid data architectures

Overview of version 10 Informatica new features…..

  • Team based development and version control support for IDQ Model Repository Service.
  • Big Data Management Configuration Utility to automate part of the configuration process for Big Data Management.
  • Enhancements to Business Glossary for Email Notification, Approval Workflow and Import/Export.
  • Various enhancements to command line programs.
  • Enhancements to Informatica Admin Console for Bird’s eye view of the Domain and Service health and Monitor progress of Jobs in detail.
  • New Profile UI and Visualizations. Features to compare the historical profile run and outlier detection.
  • Support for profiling on XML/JSON […]