Protege Media

Protege Media

Turning great content into great technology

Turning great content into great technology

Turning great content into great technology

Calliope Networks is now Protege Media.

Calliope has been featured in Forbes, Variety, Wired, and more.

Audiovisual content, ranging from movies and TV to news and sports, is incredibly valuable as AI training data. However, AI companies face many challenges when sourcing media training data, which Protege solves:

Data fragmentation

Often, the data AI developers need comes from many different sources, making it cumbersome to collect it in a resource-effective way.

Copyright and licensing issues

Negotiating separate licensing deals with potentially hundreds or thousands of providers is not scalable. And ignoring copyright issues altogether creates significant legal risks.

Usefulness for AI

In many cases, certain ethnicities, locations, languages, and topics are underrepresented in media training data, leading to skewed outputs from models that train on it. In other cases, media data comes in highly inconsistent formats, leading to significant preprocessing costs.

Data fragmentation

Often, the data AI developers need comes from many different sources, making it cumbersome to aggregate efficiently.

Protege Solution

Protege does the hard work of scouring the globe for diverse film, television, news and sports. Our media team has decades of content licensing experience and relationships on every continent.  

Copyright and licensing issues

Negotiating separate licensing deals with potentially hundreds or thousands of providers is not scalable. And ignoring copyright issues altogether creates significant legal risks.

Protege Solution

Our data is ethically sourced, copyright compliant and fully licensable for AI model training.

Usefulness for AI

In many cases, certain demographics, languages, and topics are underrepresented in media data, leading to skewed outputs from models that train on it. In other cases, media data comes in highly inconsistent formats, leading to significant preprocessing costs.

Protege Solution

We have specifically curated our dataset to represent a diverse range of locations, ethnicities, languages, objects, animals, topics, and more, making it perfect for training AI models.


Our industry-leading collection of media training data has the following advantages:

Protege solves all three of these problems by:


  • Doing the hard work of scouring the globe for diverse film, television, news and sports. Our media team has decades of content licensing experience and relationships on every continent.  

  • Ensuring that our data is ethically sourced, copyright compliant and fully licensable for AI model training.

  • Curating our dataset to represent a diverse range of locations, ethnicities, languages, objects, animals, topics, and more, making it perfect for training AI models.


Our industry-leading collection of media training data has the following advantages:

TV and film catalog with 150,000+ hours of content

Our catalog includes a broad selection of 4K and 3D content

Diversity of locations, people, objects, etc. make this perfect for AI training

We’ve built relationships with some of the largest tech companies in the world, many of which are building video or multimodal models and are very interested in media content. We are also building relationships with a wide range of AI companies that are specifically focused on media-related use cases.

If you own high-quality audiovisual content and want to explore this new incremental revenue stream, or if you need audiovisual datasets to train your models, we'd love to help. Contact us below to get started.