Skip to main content

FAQ

  • FAQ

    • How quickly can I get started using the dataset?

      Check out the Colab tutorials for a walkthrough of the data format (Colab for Perception Dataset, Colab for Motion Dataset). Just open the tutorial notebook in Colab. Please note that the tutorial currently uses some sample frames - it does not access the actual dataset files.

      The Quick Start can help you get the codebase installed on your machine locally.

    • Where can I learn the basics of Colab?

      You can learn the basics of Colab with this Welcome notebook.

    • Where can I learn the basics of TensorFlow?

      Google’s Machine Learning Crash Course is the best place to start.

    • Where can I find the metrics code?

      Check out the metrics directory in the GitHub repo.

    • Where can I find the Perception Dataset's labeling specification?

      You can find it in the GitHub repo here.

    • Is there a paper with more details on the composition of the dataset?

      Yes! You can find it on arXiv here. The paper on the Motion Dataset is also on arXiv here.

    • Is this data being offered under an open source license?

      The Dataset license has certain limitations around distribution and should not be considered an open source license. If you are interested in a license with different terms, please contact us at open-dataset@waymo.com.

    • What are some examples of acceptable and unacceptable uses of the dataset under its license?

      Example 1, Acceptable Use:

      You are a researcher at a university. You use the dataset to perform benchmarking on algorithms you’ve developed. You publish the results. Your published paper includes small extracts of data taken from the dataset for purposes of illustration. Your paper provides the appropriate attribution to the Waymo Open Dataset.

      Example 2, Acceptable Use:

      You are a researcher at a technology company. You experiment on the dataset using internal systems that are not used to provide a product or service to customers. You develop algorithms, model definitions, and training code as a result of those experiments, and check them into those internal systems with the appropriate attribution to the Waymo Open Dataset. You submit those algorithms and model definitions to a conference for public review. You include the appropriate attribution to the Waymo Open Dataset. Your submission is accepted. You release the training code to let others replicate your results.

      You are a researcher at another company. You find the publication interesting. You download the dataset from Waymo and train the published model definition against the dataset to see if you can confirm the results.

      Example 3, Unacceptable Use:

      You are an engineer at an autonomous vehicle company. You use the dataset to train a prototype object detection model for use on a vehicle at a test track. You use this trained model as a placeholder until you build a large enough internal dataset to train your model against.

      Example 4, Unacceptable Use:

      You train and fine-tune existing models based on the dataset. You use weights and biases from that model and deploy them in a system you intend to use for current or future customers.

    • Why can’t I publish my trained model?

      We want the dataset to be used for Non-Commercial Purposes only, but we want open sharing within the community of registered researchers, so we put in mechanisms to encourage people to join.

      First, there are many things that researchers can do with the dataset. For example, researchers are welcome to use the data and publish their findings. Researchers can publish algorithms, model definitions and training code developed using the dataset. Other researchers can replicate the results by downloading the dataset from Waymo and using the dataset for training.

      Second, we want to encourage transparency in how models are trained, so we encourage researchers to provide the training instructions with which others can replicate the results by downloading the dataset from Waymo and using the dataset for training.

      Third, although you cannot publish trained models, you can share the trained model you develop within the community of registered researchers.

    • Can I use the dataset to create and submit benchmark results for publication on MLPerf.org?

      Yes. We consider benchmarking for the purpose of publishing on MLPerf.org to be a non-commercial use.

    • Is republishing my official MLPerf.org scores or publishing unverified MLPerf benchmark scores elsewhere, like my blog or marketing materials, an acceptable use of the Waymo Open dataset?

      If you used the Waymo Open Dataset to create and submit benchmark results for publication on MLPerf.org, republishing the results in other locations is acceptable. If you used the Waymo Open Dataset to create unverified benchmark results consistent with the MLPerf.org rules, publishing the results in other locations is acceptable.
      Please adhere to the trademark usage guidelines and terms of use required by MLPerf.org and Waymo.

      Example formats:

      achieved a score of <0.X> on the MLPerf with the , trained on the Waymo Open Dataset [1].

      [1] MLPerf v1.0 Training Automotive Object Detection. Retrieved from mlperf.org 21 December 2021, entry 1.0-134. MLPerf name and logo are trademarks. See mlperf.org for more information. Waymo Open is a trademark. See waymo.com/open for more information. No endorsement or affiliation is implied.

      or

      announced an unverified score of <0.X> on the MLPerf with the , trained on the Waymo Open Dataset [1].

      [1] MLPerf v1.0 Training Automotive Object Detection; Result not verified by MLPerf. MLPerf name and logo are trademarks. See mlperf.org for more information. Waymo Open is a trademark. See waymo.com/open for more information. No endorsement or affiliation is implied.

    • What are you doing to ensure the privacy of people in the images?

      The images in the Waymo Open Dataset blur any license plates and faces.

    • How can I remove my entry from the challenge leaderboards?

      If you would like to remove your entry from the leaderboard, please reach out to open-dataset@waymo.com.

    • How can I get Google Cloud credits to work with the dataset?

      Faculty and PhD students can apply for Google Cloud credits.

    • Who should I contact with questions, thoughts, and suggestions on the Open Dataset?

      Please reach out to open-dataset@waymo.com.