Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crashing RStudio #6

Open
systemnova opened this issue May 20, 2021 · 3 comments
Open

Crashing RStudio #6

systemnova opened this issue May 20, 2021 · 3 comments

Comments

@systemnova
Copy link

When running the following, Rstudio Crashes:
p_load(outliertree, lubridate)
df_temp <- df %>% select(!where(is.Date)) #%>% outlier.tree(nthreads=1)
df_temp <-sample_n(df_temp,10000)
otree <- outlier.tree(df_temp, max_depth = 0)

If i random sample down to 2000, it works. So it seems like a validation issue or bad data issue somewhere. No other function has crashed when running on the whole dataset, so it appears to be an issue with outliertree.

Unfortunately I'm unable to share the dataset, but there are known column shift issues in the data set from commas in the CSV being in unexpected places and strings appearing in mostly numeric columns. Appologies that i cant be more specific, hopefully of some benefit. Because Rstudio hard crashes I'm unable to produce any other output (eg. warnings())

@david-cortes
Copy link
Owner

david-cortes commented May 20, 2021

Thanks for the bug report. A couple questions:

  • Does it crash while building the model, or after it has already been built and is being used? If the latter, it should be solved in version 1.7.2 (currently in CRAN).
  • What types of columns does the data contain? How many?
  • Does it crash if you set outliers_print=0?
  • Do you see some error message? E.g. something looking like this: std::bad_alloc()

@systemnova
Copy link
Author

systemnova commented May 20, 2021

Does it crash while building the model, or after it has already been built and is being used? If the latter, it should be solved in version 1.7.2 (currently in CRAN).

It crashes in the first few seconds of running outlier.tree
I'm using version 1.7.1 and unfortunately cant use compile binaries or use remotes to install the newer version.

What types of columns does the data contain? How many?

40 columns, 33 character, 7 integer i've just inspected a little more closely and noticed some POSIXct variables still left after the date filter. I've added a POSIXct selection now, but it still crashes when running on the whole dataset. I'm wondering if it's a POSIXct/Date value in a column where it shouldnt be thats causing the issue.

Does it crash if you set outliers_print=0?

Yes, still crashes

Do you see some error message? E.g. something looking like this: std::bad_alloc()

No, it's a hard crash with a popup dialogue saying "R Session Aborted - R encountered a fatal error. The session was terminated" no output additional output is visible in the console.

@david-cortes
Copy link
Owner

Couple more questions:

  • What operating system is this? Was the package installed from CRAN binaries or from source? If from source, did it by any chance use clang?
  • Would it still crash if you set nthreads=1?
  • Have you by any chance perhaps updated Rcpp after having installed outliertree?
  • Are you passing a data.frame from base R, or is it some derivative like data.table or tibble?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants