Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question about the biggest N #37

Open
stevenlis opened this issue Jan 20, 2020 · 7 comments
Open

question about the biggest N #37

stevenlis opened this issue Jan 20, 2020 · 7 comments

Comments

@stevenlis
Copy link

Thanks for the package! I'm trying to use it in my study with a sample size around 26,000 (10 covariates). However, in the following paper:

Messy Data, Robust Inference? Navigating Obstacles to Inference with bigKRLS:

bigKRLS can handle datasets up to approximately N = 14,000 on a personal machine before
reaching the 8 GB cutoff

Thus, I'm concerned about whether I should continue. Does this mean the program will stop running if I fit a dataset with N > 14,000? I have a laptop with 16 GB RAM. Will it be OK?

@rdrr1990
Copy link
Owner

rdrr1990 commented Jan 21, 2020 via email

@stevenlis
Copy link
Author

I tried with N = 5,000, and save.bigKRLS() generated a folder of files that takes 1.5 GB.
Assume my total N = 25,000, then I would need an 1.5*(25000/5000)^2 = 37.5 GB RAM to run the model right?

@rdrr1990
Copy link
Owner

rdrr1990 commented Jan 21, 2020 via email

@stevenlis
Copy link
Author

When I run the whole model with N around 26000. I got the following error. Am I missing something? (sorry for the low-fi image)
IMG_4865

@stevenlis
Copy link
Author

@rdrr1990 any hit?

@rbshaffer
Copy link
Collaborator

rbshaffer commented Jan 26, 2020 via email

@stevenlis
Copy link
Author

Hi @rbshaffer. Thanks for the reply. I thought it was due to my sample size.

I've tried with a sample of my dataset with more than N = 13,000, which had no issue at all. I will try it again and see if there is anyway I can share the dataset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants