top of page

Welcome to Blog by 

Jatin Madaan

Hive on Spark simple program

## PySpark code to run sql command . code : ## Importing HiveContext >>>> from pyspark.sql import Hive Context ## Create a SqlContext...

Load data into Hive table from a file on local system

To Load data from a csv (it can be pipe,tab,comma seprated ) file : Step 1 : Create a table with delimiter as given in file Command :...

Accessing Oracle using PySpark .

To run oracle commands on oracle server using pyspark . For EMR First install software sudo su pip install cx_Oracle==6.0b1 Function 1 :...

AWS S3 file copy

To copy files on local machine we can use command : aws s3 cp s3://bucket_name/folder_name/file_name.txt . there is a dot at end to...

Hadoop fs commands on S3

We can perform almost all hadoop fs commands on s3 file system as well. Eg : hadoop fs -du -s -h s3://bucket_name/folder_name 10.1 G ...

Hive SQL return code check

While running hive query using hive -e or hive -f command merely writing rc=$? below hive command will not help , it will only tell if...

AWS key for Terminal on Mac

To connect aws cluster (EMR or EC2) via terminal on mac . First make sure you download pem file from aws account. Once file has been...

Home: Blog
bottom of page