Big Data Analytics

Anything about data analytics

Problem: Sampling data from an Event Hub input of Azure Stream Analytics job failed with the following error message: “No events found for ‘<input-name>’. Start time: <Day>, <Month> <Date>, <Year>, <Local Time> End time: <Day>, <Month> <Date>, <Year>, <Local Time> Last time arrival: <Day>, <Month> <Date>, <Year>, <Local Time> Diagnostics: While sampling data, no data […]

Read more

The final part, yay!! Open a web browser and connect to the DNS name of Ambari Server. Use port 8080. For example: http://<xx.xx.xx>.compute-1.amazonaws.com:8080. Use user/password admin/admin. Click on Launch Wizard. Give the name of your cluster. Click Next. Select the stack version you would like to deploy. Click Next. Under Install Options page, enter all […]

Read more

In this part, we will install and start Ambari server. Copy the private key (.pem) file from your PC to the Ambari Server. Save the pem file as id_rsa under directory /home/<user>/.ssh/ in the Ambari server. SSH to the Ambari server. Add the ambari repository into the repository sources. The example below uses version 2.2.2. […]

Read more

Let’s continue where we’re left off! Connect to the instance by ssh. Turn off the Transparent Huge Pages. Add below lines in the /etc/rc.local file. You need to sudo the text edit command (e.g. sudo nano /etc/rc.local)  if test -f /sys/kernel/mm/transparent_hugepage/enabled; then echo never > /sys/kernel/mm/transparent_hugepage/enabled fi if test -f /sys/kernel/mm/transparent_hugepage/defrag; then echo never > […]

Read more

Visualization & Data Source Baseball Player Data Visualization Data Source Source Code Summary This data visualization is the project I worked on during my Udacity Data Analysis nanodegree. This interactive and animated visualization shows a number of summary statistics based on individual baseball players. The animated charts would give readers insights into the correlation between […]

Read more
Azure Stream Analytics Cross Apply GetElements

Problem: How can I extract data from JSON array in the Azure Stream Analytics query? Answer: Let’s say we have a simple JSON input data like below. [ { ‘DeviceId’:’TXE012345′, ‘Time’:’2018-10-15T16:01:00.0000000Z’, ‘Temp’:70, ‘Humidity’:10, ‘AreaCount’:12, ‘ActType’:2, ‘ColInfo’:[ { ‘X’:2, ‘Y’:4, ‘Z’:6, ‘A’:8, ‘B’:10 } ], ‘sensor_01’:7, ‘sensor_02’:139, ‘sensor_03’:15, ‘sensor_04’:330 } ] And we want to have […]

Read more