Loading…
Thanks for a great Analytics and Data Summit 2019 event. We hope to see you next year Feb 25-27, 2020!
To download the presentations you must first sign up with Sched for free (easy and fast!)
  • Sessions appear in the color of their primary track and can be filtered using Products on the right
  • Use the Search bar for more flexibility
See this link for hints on how to search the schedule


Thursday, March 14 • 9:50am - 10:40am
Cross-Validation & Model Selection in ML Classification: ORAAH and OAA examples

Sign up or log in to save this to your schedule and see who's attending!

Feedback form is now closed.
Cross-Validation is a great method to better estimate the expected quality of a model on future Scoring Datasets, and also to compare performance between models. From a Confusion Matrix generated for each model run, several useful statistics can be extracted, including Precision, Accuracy, Recall, True Positive Rate, F1 Score, Informedness, and others. In this session we will see an example of using k-Fold Cross-Validation to choose the best Model for several Examples, using the latest Oracle R Advanced Analytics for Hadoop (ORAAH) and Oracle Advanced Analytics (OAA). The sample R and SQL code will be explained, but familiarity with R or SQL is not crucial.

Speakers
avatar for Marcos Arancibia

Marcos Arancibia

Product Manager, Data Science and Big Data, Oracle
Marcos Arancibia is the Product Mgr for Oracle Data Science and Big Data, working with Machine Learning in the Oracle Database and on Hadoop/Spark clusters. He is charted with developing a comprehensive platform for enabling a new class of analytical workloads encompassing techniques... Read More →


Thursday March 14, 2019 9:50am - 10:40am
1-Rm 102 350 Oracle Parkway, Redwood City, CA, United States