AVRO & ORC File-format Implementation In IBM Information Server 11.5

AVRO & ORC File-format Implementation in File Connector Stage In Information Server 11.5

In my earlier posts on File connector stage in DataStage (Information Server 11.5), I discussed how it can be leveraged to read files from and write files to a local file system on the engine tier or a Hadoop Distributed File System (HDFS) by using the WebHDFS API or the HttpFS API.

 The File connector supports HDFS by using the WebHDFS API and the HttpFS API and is hence independent of the HDFS distribution and version.

  • The File connector supports Kerberos authentication.
  • The File connector supports creating Hive tables.

In this post I show how to leverage the new file format features added to File Connector namely Avro (Deflate, Snappy & Bzip2 compression supported) and ORC (zlib, snappy & lzo compression supported) formats.

All the above and new capabilities require “zero coding” and are configured on a graphical interface in DataStage designer.

 The detailed document is available on request, please message me over linkedin.

Download link for AVRO & ORC File-format Implementation In IBM Information Server 11.5 > http://bit.ly/1JZWfVi