This R package provides functions for working with data released by the NIH Brain Development Cohorts (NBDC) Data Hub, which currently offers data for the ABCD and HBCD studies.
The NBDCtools
package is designed to help researchers and data
analysts create analysis-ready datasets from the tabulated data released
by the ABCD and HBCD studies. Its core functionality is to join selected
variables and/or entire tables from the tabulated data files into a
single data frame in memory. Additionally, the package offers several
functions to assist users in working with the data, including:
- Transformation functions to convert categorical columns to (ordered or unordered) factors based on the data dictionary and levels table, or to add variable and value labels to a dataset.
- Filter/subsetting functions to filter by a set of participant/events, filter ABCD events using shorthands, or exclude rows or columns with only missing data.
- Functions to retrieve and utilize metadata from the studies.
- A set of utility functions for various use cases.
IMPORTANT: Besides the
NBDCtools
main package, users also need to install the accompanyingNBDCtoolsData
package, which contains the large data dictionary and levels tables used by theNBDCtools
package.
To install the latest version of the packages from GitHub, use the following command:
if (!requireNamespace("remotes", quietly = TRUE)) {
install.packages("remotes")
}
remotes::install_github("nbdc-datahub/NBDCtools")
remotes::install_github("nbdc-datahub/NBDCtoolsData")
There are some additional dependencies that are not installed by default, as they are only needed for specific features, such as processing shadow matrices. To install all the dependencies, use the following command:
remotes::install_github("nbdc-datahub/NBDCtools", dependencies = TRUE)
To install a specific version of the package, specify the version
number in the remotes::install_github()
call. For example, to install
version 1.0.0, use the following command:
remotes::install_github("nbdc-datahub/NBDCtools@v1.0.0")
remotes::install_github("nbdc-datahub/NBDCtoolsData@v1.0.0")
NOTE: The
NBDCtoolsData
package version does not match theNBDCtools
package version, i.e., the two package follow different release cycles. For example, theNBDCtools
package may not be changed across ABCD/HBCD data releases, but theNBDCtoolsData
package is updated with each data release. Therefore, it is recommended to always update the data package to the latest version.
Some package dependencies are not installed by default, as they are only needed for specific features, such as processing shadow matrices. To install these dependencies, use the following command:
# for shadow_bind_data function
install.packages("naniar")
For a general overview of how to download data from the NBDC Data Hub and use this R package to create custom datasets, see the Get Started page. To browse the documentation of all available functions, see the Reference page. For more detailed information about different categories of functions, refer to the vignettes.
If you encounter issues while using this package, please report them by submitting a GitHub issue. Please do not submit any issues related to the ABCD or HBCD data resources themselves (see here for information on how to report issues with the data resource).
We are currently preparing a paper about NBDCtools
. The reference
will be added here once the paper is published.