Skip to content

tverbeiren/dataMineR

 
 

Repository files navigation

dataMineR

Update 7/6/2013

In step 1, we now render tables in extended (Pandoc) markdown. This means that Pandoc can be used to output either pdf or html in a well-formatted way.

Update

We are now transforming from latex to markdown

R scripts for datamining.

This project aims to efficiently implement the CRISP datamining cycle. Initially we focus on supervised models for dichotomous class predictive models.

The project tries to combine R scripts with knitr in R-studio in such a way that we walk through the CRISP phases and deliver a scoring model in the most efficient way. In the meantime we would like to document our steps in a report that is nice enough to be used a reference of the work been done.

From wikipedia: CRISP-DM breaks the process of data mining into six major phases:

  • Business Understanding
  • Data Understanding
  • Data Preparation
  • Modelling
  • Evaluation
  • Deployment

In this project we focus on automating the steps of Data Understanding, Data Preparation, Modelling, Evaluation.

About

R scripts for datamining. Goal is to create R scripts that combine good datamining methods and knitr to generate datamining result reports based on the CRISP datamining methodology

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • CSS 56.5%
  • JavaScript 22.9%
  • R 19.7%
  • Shell 0.9%