Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate data with correlations #3

Open
jknowles opened this issue May 5, 2015 · 6 comments
Open

Generate data with correlations #3

jknowles opened this issue May 5, 2015 · 6 comments

Comments

@jknowles
Copy link

jknowles commented May 5, 2015

I started work awhile ago on a much less ambitious project than wakefield to attempt to generate random data sets on the fly with a known correlation structure. You can see the seeds of that work here: https://github.com/jknowles/datasynthR

It would be cool to include the ability to generate numeric or factor data with a known correlation structure to build structural relationships into the very realistic looking data generated by wakefield.

@trinker
Copy link
Owner

trinker commented May 9, 2015

It seems that you've done a lot of work on this already. This is pretty nice. After looking at what you have, replicating what you have is needlessly redundant.

Is there a way you could continue to develop datasynthR with the end goal ability to incorporate functionality into wakefield or as a stand alone package. Do you plan to make this a CRAN package? I'd like to see a relationship between the two packages in the way magrittr and dplyr have.

@trinker
Copy link
Owner

trinker commented May 9, 2015

Note to future self...

Depending on @jknowles response I may want to import (add to Depends:) and make a wrapper for his package. Maybe named r_distribution_cor that works similar to r_sereies.

@jknowles
Copy link
Author

@trinker I'm interested in this. I have run into a few snags with datasynthR that caused me to delay working on it while I moved on to other problems. But, I could probably return to it this summer and get a CRAN worthy version released soon enough. I'd want to check in with you about how to make the packages complementary. wakefield really solves one of those problems that I was having with datasynthR that the data generated didn't feel real enough for users who cared about more than the structure (plotting, etc.).

@trinker
Copy link
Owner

trinker commented Sep 3, 2015

@jknowles Any progress on datasynthR?

@Black-Milk
Copy link

Black-Milk commented Mar 16, 2017

@trinker Any news on this?

@jknowles
Copy link
Author

I've been revisiting datasynthR recently with a project for a client (and also exploring how wakefield works internally in the process). I imagine datasynthR will need to be refactored soon. I can't guarantee any time to be devoted to that in the coming months -- it depends on whether current projects necessitate it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants