+ Reply to Thread
Results 1 to 3 of 3

Thread: Bookclub: Data Mining with R

  1. #1
    Probably A Mammal
    Points: 14,507, Level: 78
    Level completed: 15%, Points required for next Level: 343
    bryangoodrich's Avatar
    Location
    Sacramento, California, United States
    Posts
    1,956
    Thanks
    223
    Thanked 419 Times in 387 Posts

    Bookclub: Data Mining with R



    Here I want to begin a book club for talking about Data Mining with R. I know a few of us here at TS have in our possession or have looked at this book. This format for the club is on-going. You participate at-will. If you don't have the book, check it out this summer and participate! The point is to have a repository of information, questions, and discussion on the contents. These may be theoretical, to which anyone can answer, or they may be specific, to which only those with the book may be supportive. In any case, I hope this minimal format will produce more participation than we've seen in the past (you slackers!).

    You can get the data from the website linked above, but better yet, just use their package

    Code: 
    install.packages("DMwR")
    This gives you the data sets

    Code: 
    algae                   Training data for predicting algae blooms
    test.algae (testAlgae)  Testing data for predicting algae blooms
    algae.sols (algaeSols)  The solutions for the test data set for
                            predicting algae blooms
    GSPC                    A set of daily quotes for SP500
    sales                   A data set with sale transaction reports
    This covers the main 3 cases, but not the last (microarray samples). Instead, you have to run this once

    Code: 
    source("http://bioconductor.org/biocLite.R")
    biocLite()
    biocLite("ALL")
    Then you can access the data

    Code: 
    library(Biobase)
    library(ALL)
    data(ALL)

  2. #2
    Bhoot
    Points: 1,270, Level: 19
    Level completed: 70%, Points required for next Level: 30

    Posts
    1,758
    Thanks
    40
    Thanked 124 Times in 106 Posts

    Re: Bookclub: Data Mining with R

    I am in. I could do something productive in the Finance case study.
    In the long run, we're all dead.

  3. #3
    Points: 113, Level: 2
    Level completed: 26%, Points required for next Level: 37

    Posts
    1
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: Bookclub: Data Mining with R


    Hi

    Have any of you worked through chapter 3 - Predicting Stock Markets?

    It's a really good introduction to many useful R functions for predicting and testing. However, the final section leaves you a bit lost. Have any of you worked out how to obtain the predicted signal for today's (or most recent) data point? Would like to hear from you.

    Regards,

    Laurits

+ Reply to Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts








Advertise on Talk Stats