Jatin MadaanMar 10, 20191 min readAccessing Oracle using PySpark . To run oracle commands on oracle server using pyspark . For EMR First install software sudo su pip install cx_Oracle==6.0b1 Function 1 :...
Jatin MadaanMar 10, 20191 min readAWS S3 file copy To copy files on local machine we can use command : aws s3 cp s3://bucket_name/folder_name/file_name.txt . there is a dot at end to...
Jatin MadaanMar 10, 20191 min readHadoop fs commands on S3 We can perform almost all hadoop fs commands on s3 file system as well. Eg : hadoop fs -du -s -h s3://bucket_name/folder_name 10.1 G ...
Jatin MadaanFeb 7, 20191 min readAWS key for Terminal on Mac To connect aws cluster (EMR or EC2) via terminal on mac . First make sure you download pem file from aws account. Once file has been...