The dataset is publicly available under an open license and can be accessed from: https://huggingface.co/datasets/nvidia/OpenCodeInstruct