Dataset
Tens of millions of Midjourney prompt and image datasets to training and fine-tune your image generation models!
Empowering your model training & fine-tuning
20 million data entries collected (as of February 2024), and we are adding about 2 million data entries per month!
Data entries sorted by model versions, styles, mediums, compositions, and could further be customized to contain specific keywords in prompts.
Flexible payment plans for one-time dataset purchase or long-term dataset provision. Delivery methods include download URLs or mailed hard drives.
Dataset statistics
Delivery Methods
- The client to create a cloud storage
- Dataset will be uploaded onto client’s cloud storage.
- Expect long file transfer upload time for large datasets.(Testing can be done to estimate upload time)
- Fast delivery time! Pre-loaded hard drives ready for shipping!
- Worldwide shipping available, let us know your location!
- Hard Drive cost: $20/TB (6 TB = ~ 1 million imagine entries)
Our Pricing
Option 1: No Custom Filtering Logic
Option 2: With Custom Filtering Logic
Custom Logic Filtering Steps:
- We will send to you a file with 2 million metadata texts (including types of operation, model version, prompts, data entry ID, for example see below).
- You will write the filtering script yourself to filter for the desired data entries.
- You send the working scripts back to us.
- We run the script on all existing data entries and will deliver the result.
- If you require further data cleaning services (ex.png to webp conversion, splitting images, etc), we’d charge 10% more.
Option 3: Custom Dataset Curation!
Curation Steps:
- You send us the custom imagine prompts you want to run in Midjourney
- We will run these prompts in Midjourney and return back the picture links of the generated results for you to download!
Delivery and Content:
- The unit pricings mentioned above are for downloadable URLs delivery method only. Cost of hard drives for physical hard drive delivery method is not included.
- One data entry includes: 1 JSON file and the corresponding image results, for example:
1.One JSON file (which should include the contents below)
2.Corresponding Results
Frequently Asked Questions
Midjourney is an independent research team providing state-of-the-art text-to-image generation model to the public through Discord server where users interact with its Midjourney bot. Users can send a query in natural language (i.e. a "prompt"), then the Midjourney bot will return four high-quality images and offers further options like upscaling or re-generating a variation of the original images.
© Powered by MidjourneyDataset.
All rights reserved.