Loading Events

« All Events

  • This event has passed.

R for Predictive Modeling: A Hands-On Introduction

June 22, 2017 @ 9:00 am - 4:30 pm

This one-day session provides a hands-on introduction to R, the well-known open-source platform for data analysis. Real examples are employed in order to methodically expose attendees to best practices driving R and its rich set of predictive modeling packages, providing hands-on experience and know-how. R is compared to other data analysis platforms, and common pitfalls in using R are addressed.

The instructor, a leading R developer and the creator of CARET, a core R package that streamlines the process for creating predictive models, will guide attendees on hands-on execution with R, covering:

A working knowledge of the R system
The strengths and limitations of the R language
Preparing data with R, including splitting, resampling and variable creation
Developing predictive models with R, including decision trees, support vector machines and ensemble methods
Visualization: Exploratory Data Analysis (EDA), and tools that persuade
Evaluating predictive models, including viewing lift curves, variable importance and avoiding overfitting.
Each participant will receive a copy of Max’s book Applied Predictive Modeling.
Hardware: Bring Your Own Laptop
Each workshop participant is required to bring their own laptop running Windows or OS X. The software used during this training program, R, is free and readily available for download.

Attendees receive an electronic copy of the course materials and related R code at the conclusion of the workshop.
Intended Audience: Practitioners who wish to learn how to execute on predictive analytics by way of the R language; anyone who wants “to turn ideas into software, quickly and faithfully.”

Knowledge Level: Either hands-on experience with predictive modeling (without R) or hands-on familiarity with any programming language (other than R) is sufficient background and preparation to participate in this workshop. The three-hour “R Bootcamp” is recommended preparation for this workshop.

Schedule –
Workshop starts at 9:00am
Morning Coffee Break at 10:30am – 11:00am
Lunch provided at 12:30 – 1:15pm
Afternoon Coffee Break at 2:30pm – 3:00pm
End of the Workshop: 4:30pm

Partner Event –

You must register to attend –

Apex and DataTorrent RTS is being actively used in use cases relating to Big Data, Cloud, IOT, CyberSecurity, Real-Time Anomaly detection, etc. This conf will help us understand how Apex can be used in these use cases.

To reduce time to market and total cost of ownership, look at operable solutions factory – that you can quickly import and launch. Examples: HDFS to HDFS & HDFS-Line-Copy (back-up, replication, disaster-recovery, distcp replacement); Kafka  to HDFS (ingest, transform); S3 to HDFS (cloud to on-prem); HDFS to Kafka (data lake to event stream, big data log streaming); Database to HDFS (db offload); Database to Database (change data capture, customer 360); Kafka to Database (ingest, transform & load); Kinesis to S3 (Cloud ingest, transform, & load).

Templates include ability to parse, error check, transform, and act on before loading. Additionally, You can add/modify your custom logic on transform, alerts, and actions. Templates include real-time dashboarding for instant views and historical views.

Free DataTorrent Enterprise Edition for qualifying startups. Check it out!

Free DataTorrent Enterprise Edition for Universities. Check it out!

Brought to you by DataTorrent, creators of Apache Apex.


June 22, 2017
9:00 am - 4:30 pm
Event Category:


Hilton Chicago
720 South Michigan Avenue
Chicago, IL 60605 us


Next Gen Native Hadoop Big Data Apex Users Group, Chicago