Oct 23, 2012

SyntaxHighlighter is finally configured

I have tried to set up syntax highlighting in my blog several times. Some approaches were working for some time other weren't. Finally I decided to install and configure SyntaxHighlighter.
Here is a link to the official site: http://alexgorbatchev.com/SyntaxHighlighter/
There is a brief and generic description how to set up this javascript library:
Also I have found a number of very old and actually incorrect guides how to integrate Blogger with   SyntaxHighlighter. None of them gave me what I was looking for. So I had to spent a bit more time to figure it out.

Oct 11, 2012

Hadoop Hive: How to keep your data safe

Usually you keep a lot of useful data in your hadoop cluster. You really doesn't want to lose it. Apache Hive is a very useful tool to access and query that data. The schema definition is required for Hive to parse data stored and a files on HDFS. So in a nutshell you have to create a Hive table on top of your files. It is possible to drop this table incidentally. By default all data files will be deleted also. It can cause a great deal of problem to your data warehouse. Unfortunately Hive doesn't have sufficient security to protect you data. On the other hand it has some facilities that could (and should) be used.

Oct 9, 2012

Big Data Landscape

Scripting in Bash: Convert string to array

It is a simple example how to convert multi-line string into array.