Loading…
Thanks for a great Analytics and Data Summit 2019 event. We hope to see you next year Feb 25-27, 2020!
To download the presentations you must first sign up with Sched for free (easy and fast!)
  • Sessions appear in the color of their primary track and can be filtered using Products on the right
  • Use the Search bar for more flexibility
See this link for hints on how to search the schedule


Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Big Data [clear filter]
Tuesday, March 12
 

10:10am

Generation Two: Data Lakes and Data Science on Oracle Cloud Infrastructure
Have you thought about how you will enable your AI strategy to work hand-in-hand with the full power of your enterprise data assets? Has it been aligned with a cloud first data lake and data management strategy? Have you thought through how it will be economically efficient - truly elastic - while also being performant, secure and highly available? Join this session to see how Oracle has taken a new integrated approach to a cloud native data lakes and data science on Oracle Cloud Infrastructure. Through a combination of product development updates, live demonstrations and customer stories, see how Oracle is building out a cloud native, open and enterprise class set of services for data lakes and data science on Oracle Cloud Infrastructure.

Speakers
avatar for Mike Lehmann

Mike Lehmann

VP Product Management, Oracle, Big Data and Data Science
Mike Lehmann is Vice President of Product Management for Oracle Cloud Infrastructure leading a team responsible for developing and delivering a collection of modern Data Lake and Data Science cloud services. Prior to this role, Mike led product teams responsible for building out a... Read More →


Tuesday March 12, 2019 10:10am - 11:00am
3-Rm 104

11:15am

Oracle's Machine Learning Overview, New Features and Road Map
Oracle has been embedding machine learning algorithms into the SQL kernel of the Oracle Database for many years. Oracle Advanced Analytics 18c delivers significant performance gains and new algorithms including Random Forests, Neural Networks, Unsupervised Feature Selection, Partitioned models and Market Basket Analysis improvements. Data scientists can build models in-database on billions of records in minutes and “scores” millions of records in seconds. Now, Oracle Machine Learning, bundled in the Oracle Autonomous Database, adds collaborative machine learning notebooks for data scientists. Oracle also has added a Python API (OML4Py), ORAAH and new Cognitive Analytics for images, text and deploying models as Microservices. Oracle “moves the algorithms to the data” for hybrid data management and machine learning platforms that enable enterprise-wide machine learning model deployment. Come hear about the latest developments and what’s coming next from Oracle.

Speakers
avatar for Charlie Berger

Charlie Berger

Senior Director, Product Management, Oracle
Charlie Berger is the Sr. Director of Product Management, Machine Learning, AI and Cognitive Analytics in Oracle Server Technology. Since 1999, he has been responsible for Oracle’s machine learning—starting when Oracle acquired Thinking Machines Corporation where he was VP of... Read More →


Tuesday March 12, 2019 11:15am - 12:05pm
1-Rm 102 350 Oracle Parkway, Redwood City, CA, United States

11:15am

NoSQL on Oracle Cloud Infrastructure
During this session we'll cover NoSQL solutions that are currently available on OCI, including MongoDB, DataStax and Couchbase. OCI is partnering closely with the leading vendors in this space to ensure that it is easy and simple for our customers to run whatever database they chose on Oracle's next generation cloud. We'll talk through the NoSQL ecosystem, covering open source, open core and close source products available in the ecosystem. All this is part of Oracle's broader commitment to building an open cloud. We'll discuss document stores, column stores and graph databases. We'll also demonstrate deploying a variety of these databases to OCI and discuss how a user can easily get started doing this on their own.

Speakers
avatar for Ben Lackey

Ben Lackey

Director, Data and AI Partnerships, Oracle
I lead a team focused on making it easy to run ISV workloads on Oracle Cloud Infrastructure (OCI). We partner with NoSQL vendors as well as AI companies like H2O.ai and DataRobot. Prior to Oracle I worked in cloud partnerships at DataStax and Couchbase. I've worked in a variety of... Read More →


Tuesday March 12, 2019 11:15am - 12:05pm
3-Rm 104
  • Field 2 general

1:10pm

An Oracle database approach to the Taxi Fare problem
The Taxi Fare prediction problem is a well known in machine learning (ML): we have to design, train and test a ML model to predict the taxi fare a customer of the NY City cab company will pay for a ride. This is a basic example used in several ML introductory courses and also ML and artificial intelligence (AI) websites as foundation for more advanced topics. In this presentation, we will work with the publicly available data to have it prepared to use the ML functions of Oracle Advanced Analytics embedded in Oracle Database and, more importantly, the new neural network algorithm introduced in 18c. We will see how the data was prepared, some powerful tools included in the Oracle Data Miner component of SQL*Developer, the PL/SQL code involved to create the model and, of course, some testing. All with a live demo.




Speakers
avatar for Jose Rodriguez

Jose Rodriguez

Oracle Project Engineer, Pythian
Jose Rodriguez is a project engineer at Pythian with deep expertise in a wide range of technologies including Oracle, SQL Server, DB2 and PostgreSQL databases. He is passionate about technology and is always looking for the latest tool that will enhance future projects, and deliver... Read More →


Tuesday March 12, 2019 1:10pm - 2:00pm
1-Rm 102 350 Oracle Parkway, Redwood City, CA, United States

1:10pm

Just do it! Learn how to query across Oracle Database, the Data Lake and Kafka
If you want to sit through another PowerPoint presentation - then this is not the right session for you. If you want to see how your existing applications that use Oracle SQL can now be extended with new insights - blending real time data from Kafka, unstructured data from your data lake and enterprise data from your Oracle Database - then this will be an interesting session! Oracle Big Data SQL is the enabling technology makes this possible - and we'll highlight new Big Data SQL capabilities that provide massively improved performance and deployment flexibility. Check out how it's done!

Speakers
avatar for Marty Gubar

Marty Gubar

Director PM, Oracle
I'm part of Oracle's big data product management team - with a key focus on Big Data SQL. I have a long history at Oracle - starting in the sales consulting organization - and then working in the BI, OLAP and Big Data PM groups


Tuesday March 12, 2019 1:10pm - 2:00pm
6-Rm 203

2:30pm

(HOL) Machine Learning 202: Oracle Autonomous Database & Machine Learning Notebooks
Try out the new Oracle Machine Learning Zeppelin-based notebooks that come with the Oracle Autonomous Database in this introductory Hands-On-Lab. Oracle Machine Learning extends Oracle’s offerings in the cloud with its collaborative notebook environment that helps data scientist teams build, share, document, and automate data analysis methodologies that run 100% in the Oracle Autonomous Database. Interactively work with your data, build, evaluate and apply machine learning models. Import, export, edit, run and share Oracle Machine Learning notebooks with other data scientists and colleagues all on the Oracle Autonomous Database. Share and further explore your insights and predictions using the Oracle Analytics Cloud.

Speakers
avatar for Tim Vlamis

Tim Vlamis

VP & Analytics Strategist, Vlamis Software Solutions
An Oracle Ace and expert in the visualization of data and the design of business analytics strategies, Tim combines a strong background in the application of Oracle-based business analytics and data mining with extensive experience in business modeling and valuation analysis. Tim... Read More →
avatar for Derrick Cameron

Derrick Cameron

Master Principal Solutions Engineer, Oracle Americas Inc.
Derrick Cameron is part of Oracle's Solution Engineering Developer Innovation Team, and is a 25+ year Oracle veteran. His career journey is varied, starting out as an EBS implementation consultant and moved on to develop deep expertise in Analytics and Data Management. Over the years... Read More →
avatar for Charlie Berger

Charlie Berger

Senior Director, Product Management, Oracle
Charlie Berger is the Sr. Director of Product Management, Machine Learning, AI and Cognitive Analytics in Oracle Server Technology. Since 1999, he has been responsible for Oracle’s machine learning—starting when Oracle acquired Thinking Machines Corporation where he was VP of... Read More →


Tuesday March 12, 2019 2:30pm - 3:20pm
5-Rm 202 HOL

4:20pm

The Roadmap for Big Data Cloud Service and Big Data Appliance
Driven by mega trends like AI and Cloud, Big Data platforms are evolving. But does this mean we need to throw away everything that is being used right now? This session discusses the impact mega trends have on the Big Data platforms and how you can ensure that you properly invest for a solid future. We will discuss how platforms like Big Data Appliance cater to these trends and spend some time sharing the roadmap for Big Data Cloud Service on Oracle Cloud Infrastructure. The combination of these products will enable customers and partners to build out a platform for Big Data, AI, both on premises and in Oracle Cloud, leveraging the synergy between the platforms and the trends to solve actual business problems.


Speakers
avatar for Jacco Draaijer

Jacco Draaijer

Vice President Software Development, Oracle
Jacco Draaijer is a Vice President in the Oracle Server Technology Division, leading Big Data development. His team is responsible for the Big Data product suite that consists of Oracle's Big Data Appliance, Big Data SQL and Big Data Cloud Service. Jacco has been with Oracle since... Read More →


Tuesday March 12, 2019 4:20pm - 5:10pm
6-Rm 203
 
Wednesday, March 13
 

10:05am

Analysis and Interpretation of 3D Point Clouds Based on Machine Learning
Today, landscapes, cities, and infrastructure networks are commonly captured using LiDAR or image-based remote sensing technologies at regular intervals. The resulting 3D point clouds are digital twins of the captured sites and provide valuable information. Hence, 3D point clouds have become essential for a growing number of geo-spatial applications such as urban planning, environmental monitoring, and disaster management.
In this talk, we present techniques to manage, process, and analyze 3D point clouds. We discuss the potential of AI and demonstrate how a machine-learning-based interpretation can be used to classify data, derive semantics and create insights. We show how the Oracle Cloud Infrastructure with GPUs and Database Spatial and Graph allows for handling and processing of massive amounts of data. The implemented approach opens up new ways for the full potential of large-scale, dense, and time-variant point clouds as required by a growing number of applications and systems.

Speakers
avatar for Rico Richter

Rico Richter

Dr., Hasso-Plattner-Institut / Uni Potsdam
Rico Richter has many years of experience in the development of complex processing, analysis and visualization techniques for 3D spatial data. He studied IT Systems Engineering at the Hasso Plattner Institute in Germany and has written his PhD thesis entitled “Concepts and Techniques... Read More →


Wednesday March 13, 2019 10:05am - 10:30am
2-Rm 103

10:05am

Query Kafka Streams with Oracle SQL
Managing streaming data is important to many data analytics applications. Enterprise applications need to work with IoT device feeds, clickstream data, social media feeds, and more. Oracle SQL access to Kafka streams gives SQL developers a simple and powerful way to work with streaming data platforms like Apache Kafka, a popular technology for managing streaming data. With this feature developers can use Oracle SQL to query streaming data, and integrate with data stored in the database. They can use the analytics features of Oracle SQL when working with streaming data, and use other familiar development tools. This session is a how-to on using Oracle SQL query to Kafka streams, join Kafka streams with database tables, and load high speed streaming data into the database. We will walk through use cases using Oracle Cloud services.

Speakers
avatar for Melliyal Annamalai

Melliyal Annamalai

Product Manager, Oracle
Melli Annamalai is a product manager at Oracle. She has vast experience in multiple technology areas related to unstructured and semi-structured data management. One of her current focus areas is Big Data, with an emphasis on integrating data from streaming sources and Hadoop environments... Read More →


Wednesday March 13, 2019 10:05am - 10:55am
3-Rm 104

11:10am

How to build a Data Lab: strategy and methodology
Everyone is talking about it (Artificial Intelligence, Machine Learning) but who is really doing it and what lessons have been learned along the way. This presentation provides a set of best practices for organizing, staffing, and measuring the outcomes of a data lab. Real-wold examples from Oracle customers will be shared as we look together at the people, processes and technology that lead to measurable business outcomes.

Speakers
avatar for Marcos Arancibia

Marcos Arancibia

Product Manager, Data Science and Big Data, Oracle
Marcos Arancibia is the Product Mgr for Oracle Data Science and Big Data, working with Machine Learning in the Oracle Database and on Hadoop/Spark clusters. He is charted with developing a comprehensive platform for enabling a new class of analytical workloads encompassing techniques... Read More →


Wednesday March 13, 2019 11:10am - 12:00pm
6-Rm 203
  • Field 2 general

11:10am

Analyzing binary data in Hadoop
While most data analysis is textual, this presentation discusses the use of Java Map-Reduce to extract the metadata from sonar files collected by UUV (Unmanned Underwater Vehicle) missions. The sheer volume of files generated makes this a good candidate for Hadoop divide and conquer. The geolocation and timestamp information is extracted and loaded into a Hive Database, followed by a Sqoop job to transfer the data to an Oracle Database. Oracle APEX (Application Express) is then used to visualize the information on a Google map. The motivation for this effort comes from a prior project that extracted event information from Windows Event logs (using Java Map-Reduce). In addition the depth information is also used with basic AI classification algorithms in Hadoop/Spark for prediction.





Speakers
AA

Abhinav Arya

Student, UC Berkeley
avatar for Arijit Das

Arijit Das

Research Associate, Naval Postgraduate School
Arijit Das is a computer scientist with BS/MS degrees in Computer Science (MNREC, India & Oregon State University) and an MSEE from University of Nevada Las Vegas. He currently is a research faculty at the Computer Science Department of the Naval Postgraduate School in Monterey, California... Read More →
PK

Pyojeong Kim

Student, Naval postgraduate school


Wednesday March 13, 2019 11:10am - 12:00pm
4-Rm 105

1:15pm

Analysis of Sensor Data Using Oracle Advanced Analytics
Filtering out noise and minimizing inaccuracies of sensor data - challenges in the data cleaning process - can be performed by anomaly detection using the Oracle OAAgraph package, which enables Oracle R Enterprise users to interact with Oracle Spatial and Graph's Parallel Graph Analytics (PGX) engine. In addition, selecting features from sensor data, which is vital in interpreting sensor data and extracting patterns, can be achieved by Oracle graph analytics. Both applications of Oracle Advanced Analytics will help researchers achieve their goals and lead to findings in more diverse domains.




Speakers
avatar for Beiyu Lin

Beiyu Lin

Graduate Research Assistant, Washington State University
Beiyu Lin is a Ph.D. candidate in Computer Science at Washington State University, focusing on data mining and machine learning. She constructs computational models based on smart home sensor data for the purpose of identifying routine behavior patterns and assessing behavior changes... Read More →


Wednesday March 13, 2019 1:15pm - 1:30pm
3-Rm 104

2:20pm

Solutions for bi-directional integration between Oracle RDBMS and Apache Kafka
Apache Kafka is a popular distributed streaming data platform. A Kafka cluster stores streams of records (messages) in categories called topics. It is the architectural backbone of modern data analytics. Data flowing into Kafka often originates from native data streams such as social media streams, telemetry data, financial transactions and many others. But these data streams only contain part of the information. A lot of data necessary in stream processing is stored in traditional systems backed by relational databases. To implement new and modern, real-time solutions, an up-to-date view of that information is needed. So how do we make sure that information can flow between the RDMBS and Kafka, so that changes are available in Kafka as soon as possible in near-real-time? It this session, we present different approaches for integrating relational databases with Kafka, such as Kafka Connect, Oracle GoldenGate and bridging Kafka with Oracle Advanced Queuing (AQ).


Speakers
avatar for Guido Schmutz

Guido Schmutz

Platform Architect, Trivadis
Guido Schmutz works for the Oracle Platinum Partner Trivadis. He has more than 30 years of technology experience. At Trivadis he leads the Trivadis Architecture Board. He has long-time experience as developer, coach, trainer, and architect in the area of building IT systems. In the... Read More →


Wednesday March 13, 2019 2:20pm - 3:10pm
4-Rm 105

3:25pm

Autonomous Data Warehouse the new Data Lake
Over the past several years Hadoop based Data Lakes have emerged in sometimes successful but often challenged implementations. While Hadoop is optimized for large data storage, it struggles to achieve the ad hoc performance that most companies need in a modern data platform, driving their Data Lake. Soft costs for Hadoop like finding qualified resources and total cost of ownership, have hurt Hadoop's ROI in the enterprise for all but the most extreme big data cases.

What has emerged from the Data Lake model is a fantastic pattern for engineering data that can totally satisfy most organizations. This pattern of Ingest, Model and Refine is the new basis for the modern data platform. If only there was a great architecture to deploy this pattern. Enter the Oracle Autonomous Data Warehouse, this cloud based platform with object storage capabilities is the ideal platform for your powerful Data Lake, with easy of maintenance, up time and low total cost of ownership.

Speakers
avatar for Jeremy Simmons

Jeremy Simmons

BI Practice Lead, Qubix Inc
A business and technology consultant with over 20 years of experience in delivering value to organizations by closing the gap between business needs and IT capabilities. I take a hands on approach to delivering technical work, with a strong ability to speak to and gather C-Level requirements... Read More →


Wednesday March 13, 2019 3:25pm - 4:15pm
6-Rm 203

3:25pm

Extreme Data Visualization to Drive User Adoption
Accelerate business insights through data discovery, analytics and comprehensive Visualization delivered through Oracle BI. This session focuses on LAUSD deployment of an advanced, custom UI/UX framework on Oracle BI stack to deliver extreme Visualization and a live demonstration on how we adopted a simplified reporting platform for line-of-business users on key metrics, dazzling visualization and infographics delivered through BI dashboards to help improve the decision-making process across the organization.



Speakers
avatar for Grace Peng

Grace Peng

Technical Specialist, LAUSD
I am IT executive with specialized skills in enterprise information management and analytics for LAUSD. As dynamic information technology leader and certified Project Management Professional with success in program/project management, problem solving, strategic planning, and operations... Read More →
GS

Gautham Sampath

Executive Director, Innive Inc


Wednesday March 13, 2019 3:25pm - 4:15pm
4-Rm 105

4:30pm

Customer Success with Autonomous Data Warehouse
Overview of at least 10 business use cases and value of Autonomous Data Warehouse (ADW). I will cover the early implementations of Autonomous Database why the companies chose it, how the implementation went, and the business value they are realising.

Speakers
avatar for Laura McKechnie

Laura McKechnie

Product Manager, Oracle
I'm in Oracle Product Management for Data Warehousing and Big Data, and run our Global Leaders Program for Americas, ie look after our key customers running our data management solutions!
avatar for Pete Schutt

Pete Schutt

Sr Director, Oracle
Pete Schutt is a Senior Director of Marketing in Oracle Cloud Business Group. He curates customer success stories in multimedia content assets for executive communication across all marketing and sales channels.


Wednesday March 13, 2019 4:30pm - 5:00pm
6-Rm 203
 
Thursday, March 14
 

9:50am

Cross-Validation & Model Selection in ML Classification: ORAAH and OAA examples
Cross-Validation is a great method to better estimate the expected quality of a model on future Scoring Datasets, and also to compare performance between models. From a Confusion Matrix generated for each model run, several useful statistics can be extracted, including Precision, Accuracy, Recall, True Positive Rate, F1 Score, Informedness, and others. In this session we will see an example of using k-Fold Cross-Validation to choose the best Model for several Examples, using the latest Oracle R Advanced Analytics for Hadoop (ORAAH) and Oracle Advanced Analytics (OAA). The sample R and SQL code will be explained, but familiarity with R or SQL is not crucial.

Speakers
avatar for Marcos Arancibia

Marcos Arancibia

Product Manager, Data Science and Big Data, Oracle
Marcos Arancibia is the Product Mgr for Oracle Data Science and Big Data, working with Machine Learning in the Oracle Database and on Hadoop/Spark clusters. He is charted with developing a comprehensive platform for enabling a new class of analytical workloads encompassing techniques... Read More →


Thursday March 14, 2019 9:50am - 10:40am
1-Rm 102 350 Oracle Parkway, Redwood City, CA, United States

9:50am

Innovative Data Protection for Big Data: Tackling GDPR and Other Data Requiremen
GDPR, HIPAA, and other regulation forces every organization to scrutinize their data protection strategies implemented on big data. Most solutions involve application security, which presents some serious issues. This session presents a completely new strategy: securing the file itself directly inside the file. This innovative way of securing file-based data ensures that any organization can store file data in a manner that is secure by default and compliant by default. Explore the details of the technology and see solution patterns and implementation strategies to ensure your big data repositories can comply with stringent protection regulations while delivering analytics at speed.


Speakers
avatar for Shrikumar Hariharasubrahmanian

Shrikumar Hariharasubrahmanian

Lead Researcher, Oracle Labs
Shrikumar is a lead researcher at Oracle Labs. His main research interests lie in the interplay between hardware, software, OS and cross-system integration, with a focus on improving the efficiencies of high performance communication using an approach guided by the principle of "De-Layering".Applying... Read More →


Thursday March 14, 2019 9:50am - 10:40am
6-Rm 203

10:55am

Machine Learning & AI Applications for Smart Manufacturing
Learn about the recently announced Oracle Adaptive Intelligent (AI) Applications for Manufacturing. This SaaS offering collects, stores, and analyzes massive amounts of Operational Technology (OT) data coming from sensor enabled shop floor equipment and machines and contextualizes it with Information Technology (IT) data coming from enterprise applications such as ERP, HCM, SCM, MES, Quality, LIMS and analyzes it by applying machine learning, and artificial intelligence techniques to discover key patterns and correlations that affect manufacturing efficiencies and provides actionable predictive analytics to maximize yield and minimize defects, scrap, rework, costs, returns. It also provides comprehensive capabilities for backward and forward tracing of products and processes within manufacturing and supply chain spanning manpower, machine, material, method and management related information to facilitate rapid root cause, impact and containment analysis.

Speakers
avatar for Gopal Ratnam

Gopal Ratnam

Senior Director, Product Management & Strategy, Oracle
TBD


Thursday March 14, 2019 10:55am - 11:45am
1-Rm 102 350 Oracle Parkway, Redwood City, CA, United States

10:55am

Self-diagnosis and Management of Performance and Availability for Autonomous Database
Autonomous databases signal considerable advancement from manual processes previously required by legacy databases that have been prevalent in previous decades. Removal of the human operator and as such, associated human error, from the control loop is the basic premise of autonomous databases. Self-driving and virtually infallible software agents may now take control of management and administrative tasks relating to database function that were once completed manually. Modern databases can generate vast numbers of diverse streams of sensory data with important implications for database behavior and performance. Such sensory data range from expanded performance metrics to internal and external resource utilization to workload statistics and detailed process logs. Given that traditional hands-on inspection procedures are unable to match the expansive volume of this data, there now exists a unique opportunity to leverage this data and the application of machine learning techniques






Speakers
avatar for Yaser Suleiman

Yaser Suleiman

CMTS, Oracle
Yaser Suleiman, CMTS at Oracle HQ, Database RAC organization, developer in CHA team (part of the Autonomous Health).


Thursday March 14, 2019 10:55am - 11:45am
3-Rm 104

1:30pm

Four awarded Brazilian use cases using Oracle Big Data, ML and Analytics
In this session Marcelo Bellorio, enterprise solutions architect for AI, Big Data and Analytics in Oracle Brazil, will drill down on the Architectures, Business Outcomes and what`s next for the four Oracle Open World 2018-awarded Brazilian companies use cases:
1 - United Health Group Brazil (Healthcare): Oracle Global Leaders 2018 Architect Leader of the Year Excellence Award for Latin America for the Healthcare and Legal cases on top of Oracle's Big Data architecture
2 - Nexel Brazil (Telco): OOW 2018 Oracle Excellence Award for Data Management Innovation AND Oracle Global Leaders 2018 DW & Big Data Leader of the Year Excellence Award for the Telco Machine Learning use cases on top of the Oracle's Big Data, ADW and Analytics Platforms
3 - Oi (Telco): OOW 2018 Oracle Excellence Award for Analytics Innovation for the Analytical Spatial's ML case on top of Oracle's DW and Analytics Platform
4 - GEQ (Gas & Consumer Goods): OOW 2018 excellence Award Finalist for the ML Tax Intelligence case

Speakers
avatar for Marcelo Bellorio

Marcelo Bellorio

Enterprise Solutions Architect, Oracle
I am a passionate individual about working within diverse and multicultural teams, strong believer in the value of people, data and technology within the organizations for business development and transformation. With experience in both the computer science and business fields, I... Read More →


Thursday March 14, 2019 1:30pm - 2:20pm
5-Rm 202 HOL

1:30pm

Oracle Cloud Infrastructure provides a Modern Infrastructure for Analytics
Do you want more agility for your AI and ML workloads? Oracle Cloud Infrastructure is built from the ground up for enterprise and next generation workloads. In this session, learn about infrastructure fundamentals and understand how to effectively deploy in cloud for improved agility. Hear about the latest services and offerings that enable analytics and help speed decision making. Get a demo on our GPU offerings and how that can be applied to analytics, learn about how storage can enable analytics and how the right network infrastructure can speed workload execution. Hear about what is coming next.


Speakers
avatar for Ed Beauvais

Ed Beauvais

Director Product Management, Oracle, Oracle Cloud Infrastructure
Ed Beauvais is Director of Product Management for Oracle Cloud Infrastructure. Ed's roles have spanned Product Management, Technology Strategy and Product Marketing. Ed is a strategic innovator with progressive experience in successfully directing new product initiatives, 1.0 product... Read More →


Thursday March 14, 2019 1:30pm - 2:20pm
1-Rm 102 350 Oracle Parkway, Redwood City, CA, United States

2:30pm

Building High performance and Highly available Hadoop cluster. Practical lessons
In my current position in Oracle I'm working with many customers all around the world and help them to be successful with their Big Data projects. So, I'm kind of bridge between development and fields, bring and create best practices from Development to the field and bring feedback from the fields to Development for adjust roadmap of Oracle Big Data products. After accumulating all practical experience and gather feedback from customers we found really good default configs for Hadoop Clusters, which could fin for 90% of the customers for their needs.
It include 5 pillars:
- YARN performance tuning
- Spark performance tuning
- Set up Static Service pools
- Set up Dynamic Service pools
- Setup Hadoop cluster in Highly available way
In my session, I talk about key aspects of well configured Hadoop clusters.


Speakers
avatar for Alexey Filanovskiy

Alexey Filanovskiy

Product Manager, Oracle
I'm Alexey Filanovskiy - part of the development organization in Oracle. I use to be Oracle Developer, but since 2010 Hadoop world captivated me with his head. Since 2012 I'm working with Big Data technologies in Oracle. I've started my carrier in pre-sales organization and in 2015... Read More →


Thursday March 14, 2019 2:30pm - 3:20pm
1-Rm 102 350 Oracle Parkway, Redwood City, CA, United States

3:40pm

Location-Based Tracking of Moving Objects with Apache Spark
Tracking K nearest neighbors from a large dataset to a given location is an expensive task. This session shows how to use Oracle Big Data Spatial and Graph’s Vector API to build a real-time Spark streaming application that constantly calculates the K nearest moving vehicles for every element from a set of static stations, based on the distance and user-defined business rules. The vehicle location data is continuously received from a stream that sends updated vehicle locations every few seconds so the nearest vehicles for each station are constantly updated and written to HDFS. This presentation also shows how to build a map visualization application to view this real-time tracking with the Oracle Maps API.

Speakers
avatar for Siva Ravada

Siva Ravada

Senior Director of Development, Oracle, USA
Siva manages the development team for Oracle’s spatial technologies. He is one of the founding members of the development of Spatial technologies in Oracle since the Oracle7 release. He holds numerous patents, has several journal and conference publications, and authored a book... Read More →


Thursday March 14, 2019 3:40pm - 4:30pm
2-Rm 103