How can we help?

Search

Does your training data include any data protected by copyright, trademark, or patent, or is it entirely in the public domain?

March 31, 2026 09:33
Updated

The publicly available and open-source datasets that we have used in some instances for model training are datasets that are used across industry for training language models and may include some protected text. We also use text that has been made available under Creative Commons licenses as well as text that is in the public domain.

Comments

0 comments

Please sign in to leave a comment.

How can we help?

Search

Related articles