Skip to content

markobigdata

Big Data documentation in a blog

Search

Recent Posts

  • Fargate in AWS ECS with Terraform 12/08/2021
  • Docker, AWS, Python3 and boto3 17/12/2019
  • Using Python 3 with Apache Spark on CentOS 7 with help of virtualenv 11/12/2019
  • Nginx, Gunicorn and Dash on CentOS 05/12/2019
  • Automating access from Apache Spark to S3 with Ansible 27/09/2019
  • Zealpath and Trivago: case for AWS Cloud Engineer position 23/09/2019
  • Capturing messages in Event Hubs to Blob Storage 08/08/2019
  • Streaming messages from Kafka to EventHub with MirrorMaker 07/08/2019
  • Provision Apache Spark in AWS with Hashistack and Ansible 31/07/2019
  • Spark, Scala, sbt and S3 22/05/2019

Archives

Categories

  • Ambari (23)
  • Ambari Infra (2)
  • Ambari Metrics (2)
  • Ansible (4)
  • Apache (1)
  • Athena (1)
  • Avro (1)
  • AWS (11)
  • Azure (3)
  • benchmark (1)
  • Blob Storage (1)
  • Client (5)
  • Cloudformation (1)
  • Consul (4)
  • Dash (1)
  • DataNode (5)
  • Docker (9)
  • Druid (1)
  • ECS (1)
  • Event Hubs (2)
  • Fargate (1)
  • Flume (1)
  • fsck (2)
  • git (1)
  • GitHub (6)
  • GitHub Desktop (1)
  • Glue (1)
  • Grafana (3)
  • Gunicorn (1)
  • Hadoop 3 (1)
  • HDFS (8)
  • HDFS Snapshot (1)
  • HDP 2.6 (3)
  • Hive (3)
  • Hortonworks (41)
  • Infrastructure-as-Code (3)
  • Intellij IDEA (1)
  • Java (1)
  • Java 9 (1)
  • Jupyter (1)
  • Kafka (1)
  • Lambda architecture (3)
  • Machine Learning (1)
  • Marz (2)
  • maven (1)
  • MirrorMaker (1)
  • MRbench (1)
  • Namenode (2)
  • Nginx (1)
  • Notes (3)
  • Pig (2)
  • Powershell (1)
  • Python3 (3)
  • R (4)
  • Ranger (3)
  • S3 (4)
  • sbt (3)
  • Scala (5)
  • Scala IDE (1)
  • SerDe (1)
  • Solr (2)
  • Spark (1)
  • Spark 1.4.1 (3)
  • Spark 1.5.2 (4)
  • Spark 1.6.0 (11)
  • Spark 2.0 (7)
  • Spark 2.1.0 (1)
  • Spark 2.2.1 (1)
  • Spark 2.4.0 (1)
  • Spark Configuration (12)
  • Spark Summit (1)
  • Spark Summit East 2017 (1)
  • Spark Summit Europe 2016 (1)
  • Spark3.0.0 (1)
  • SparkContext (5)
  • sparkR (5)
  • SparkSession (3)
  • Storm (4)
  • TensorFlow (1)
  • Terraform (5)
  • TestDFSIO (1)
  • Tez (1)
  • Thrift (2)
  • Upgrade (5)
  • virtual environment (2)
  • Virtual Machine (1)
  • Visual Studio Code (1)
  • VPC (1)
  • WARN ServletHandler: /api/v1/applications (1)
  • Workaround (2)
  • YARN (2)
  • Zeppelin (4)
  • ZeppelinR (2)

Category: Spark Summit East 2017

Notes from Spark Summit East 2017

The Spark and Hadoop summits are one of my most valuable resources for keeping up to date with these technologies. Over 100 talks at this summit. So far, I have watched almost 60 of them and made short notes with print screens.

Hope someone else finds it useful as well.

Spark Summit East 2017

I ll update the file accordingly.

 

By markobigdatain Notes, Spark Summit, Spark Summit East 201714/05/201715/05/201758 WordsLeave a comment
Blog at WordPress.com.
  • Follow Following
    • markobigdata
    • Already have a WordPress.com account? Log in now.
    • markobigdata
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar