About Krishna Kodeboyena

This author has not yet filled in any details.
So far Krishna Kodeboyena has created 26 blog entries.

What’s the Point of Data Governance ?


If you are in information technology, you have no doubt heard the term “data governance” bantered about more and more.  The very sound of “data governance” conjures up images of layers of bureaucracy and regulatory minutiae.  If you are in the ranks of development or change control – all that can be imagined from this term is just one more layer of regulations to navigate through for a simple data related change request to be approved and released.

Maybe your industry mandates the existence of said “data governance” to satisfy some compliance requirements, or maybe the mere fact that other companies are creating their “data governance” programs has caused the powers that be at your organization to determine that yours has got to have one too.  Whatever the reason for its inception at your organization, what is the point of “data governance”? What does it really mean?

At its best, data governance provides […]

Know about Snowflake – Ur Data. No limits.


Snowflake is a data warehouse built for the cloud which delivers a capable solution in resolving issues for which legacy, cloud data platforms and on-premises data warehouse were not designed. Snowflake works with leading data management, data integration and BI partners to bring together all data and enable the users to perform cutting-edge analytics.

Snowflake is the first analytical database that leverages the power of cloud. Adapting snowflake is simple and it offers great performance and concurrency. It supports distributed architecture, data protection, query resiliency and significantly maintain fault tolerance. In addition, snowflake services can be run on a public cloud infrastructure.

Snowflake architecture is divided into three layers, they are:

  1. Cloud services
  2. Virtual warehouses
  3. Database storage

Functionality of Cloud Data Warehouse

Data warehouse is basically a relation database which is exclusively designed for query and analysis as a substitute of transaction process. But it holds resulting historical data from a transaction data.


Back Office Process Automation

What’s new in Back Office Process Automation?

This is entirely a new approach to accomplish the task without human intervention. Project employed “Robotic Process Automation” to action the task with high accuracy. Robotic process automation has an extra milestone capacity where a computer or a machine will be replacing with a human in a digital workforce environment. Precisely an automation bots will be operated on a presentation of any desktop application or web application and execute it in the multistep process around the clock with nix errors. The objective behind implementing RPA technology is to have great business agility with minimal human efforts and create an error free milieu. By employing automation in the daily work environment will have an immense improvement in the overall efficiency and increase of company productivity.


Are the automation process tools limited to certain industries or products?

No! Since robotic process automation can interact with all the interfaces, it’s […]


Informatica – the world’s leader in ETL market and number one independent software provider officially released Informatica version 10 – a transformative innovation for future of ‘all-things-data’ on October 13 2015.

Informatica v10 is an award-winning data management platform, designed to modernize and boost performance of enterprise data architectures.  With enhanced agility and performance combined with new features, Informatica v10 delivers accurate data at nearly any speed across today’s hybrid IT environments, both cloud and on premise.

Anil Chakravarthy – CEO and former chief product officer – Informatica Corp, in his own words:

“With Informatica v10, architects can modernize their IT environment by leveraging top performance and improved data governance for modern data warehouses, along with the flexibility of full Cloud support via a data hub, and increased end-to-end data integration agility.”

If you would like to read more about the article, click the link here ——Informaitca_v10

Big Data Trends in 2016

It is 2016 and data is growing more rapidly than ever. 2015 was big data’s year. There were many conferences related to big data everywhere. Professionals working in different industries, such as healthcare, insurance, bank, and etc., were eager to learn more about big data to solve their big data problems or perhaps, to seek its potentials.

If you would like to read more about the article, click the link here ——TrendsInBigData2016

HBase Data Extraction

Our Client is a NE based data solution provider in the healthcare industry. The client manages a single node CDH5 cluster Ver 5.3.2 in Ubuntu (Trusted Tahr) . The client had two main concerns. One of them being extracting data from HBase. Each table in HBase has its own metadata file. The metadata files provide information about the tables in HBase, including which columns to include and exclude from the output. The other concern was to convert the output data to JSON format.

In order to extract data from HBase, Pig is used. Originally, different approaches were made to interact with HBase. However, after exploring different options, Pig was an apt solution for this project due to its built-in functions and UDF flexibility. Only one UDF is used in this project, which is written in Python.

If you would like to read more about the article, click the link here ——-HbaseExtractionFinal


IT Industry has underlying assumptions that the projects will not meet the deadlines or will run over budget. Often times, the projects that are completed will not perform the way we expect.

Many a times these problems exists due to lack of communication between the various project teams.

DevOps was first introduced in 2008 Agile Conference by Andrew Clay Shafer and Patrick Debois . DevOps is a cultural change which encourages communication and collaboration between the Software Developers and the Information Technology Professionals. DevOps helps in optimizing  SDLC. Software building , testing, release are done frequently and rapidly which has been the biggest advantage.

To read more about the article, please click the link—DevOps

Hospital Data Comparison – Tableau BI Report

Hospital Compare has information about the quality of care over 4500 Medicare- certified hospitals across the country. We have used this information to do the analysis that which hospitals are good for particular condition. Either way the information is useful for the hospital to make a sound decision for quality of the care they provide. When you have a life-threatening emergency, always go to the nearest hospital. However, if you’re planning to have surgery, or if you have a condition like heart disease and know you may need hospital care in the future. Research shows that some hospitals do a better job taking care of patients with certain conditions than other hospitals.

To read more about the report, please click the link—HospitalCompareBI

For the interaction with this tableau report, please go to:


Vizable: Tableau’s New Mobile App

Vizable is a free mobile app which extend tableau from pc end onto mobile end. It towards to normal end users and executives which simplifies the process for digging data, helps end user to explore data by seeing, touching, then eventually understanding.
Here is the layout of Vizable interface, clean in look and user friendly. By looking at the view of Vizable we can easily find out it has two basic graphic forms, bar chart and line chart.

Apparently Vizable is designed for light weighted data, and different purposes from tableau. Tableau dashboards developing is more about business strategy using and visual variation. One the other hand, Vizable tends to give end user a general analytic idea with their own data to answer those simple analytic questions. For example, how is my work out performance? How is my truck drivers’ driving? How is my asset affected by my purchasing behavior? By inputting those […]

SnapLogic – Big Data Integration

Snap In To Data, Apps and APIs

SnapLogic is focusing on the framework required to implement strong data management in addition to integration. Enhancing self-service components through better task management and overall Snap use shows a strong commitment to providing organizations with a way to manage data acquisition lifecycle through a reusable framework.”


Change is inevitable, and a modern integration platform is powerful, yet easy to use and expand allows enterprise IT organizations to respond to change faster and future-proof their applications and data infrastructure.

It is a unified data and application integration platform as a service(iPaaS). SnapLogic has more than 300 Snaps, prebuilt integration processes. SnapLogic Elastic Integration platform provide facilities to connect faster and gain a better return on their cloud application.

The use of snaps to connect data highlights Snap Logic providing self-service data integration to different LOB users. This helps organizations to […]