RealCustom: Achieve Real-Time Text-to-Image Customization with Unparalleled Control
Want to generate custom images from text that perfectly match your vision? Discover RealCustom, a revolutionary approach to text-to-image customization that delivers unmatched subject fidelity and textual control in real-time. See how it works and why it's a game-changer!
The Challenge: Balancing Fidelity and Control in Text-to-Image Generation
Existing subject-driven image generation methods often struggle to reconcile visual and textual inputs. This creates a frustrating trade-off:
- Subject Fidelity: Maintaining the likeness of the subject matter.
- Textual Controllability: Accurately reflecting the prompt's instructions.
RealCustom solves this problem by disentangling these two key features, enabling simultaneous optimization of both.
Introducing RealCustom: The Real-World Solution for Image Customization
RealCustom bridges the gap between text and visuals by:
- Representing Subjects as Real Words: It treats subjects as interchangeable real words, seamlessly integrating them into text prompts.
- Disentangling Influences: It separates the visual aspects from textual instructions, giving you precise control.
How RealCustom Works: A Step-by-Step Guide
Ready to create stunning personalized images? RealCustom simplifies the process into two easy steps:
Step 1: Character Creation - Define Your Subject (Character)
- Character Image: This is your reference image. Ideally, use a high-quality, close-up image with a simple, distraction-free background. The subject should be clear and prominent.
- Character Description: Create a brief description that highlights the subject and its key features.
Step 2: Character-Driven Image Generation - Unleash Your Creativity
- Input Prompt: Craft your desired image prompt, but instead of directly mentioning the subject, use the "character" you defined in Step 1.
- Fine-Tune: Use "Face Reference Strength" and "Body Reference Strength" to fine-tune the visual fidelity and likeness of your subject.
Benefits of Using RealCustom
- Simultaneous Optimization: Achieve high subject fidelity and precise textual control at the same time.
- Real-Time Performance: Experience quick image generation turn around.
- Enhanced Customization: Enjoy advanced algorithms previously accessible only via commercial applications like Dreamina and Doubao.
- Simplified Workflow: Get started fast with an intuitive and user-friendly process.
Get Started with RealCustom Today!
Ready to experience the future of text-to-image customization? Here's how to get started:
Installation
- Install Requirements: Refer to the project's README for a list of dependencies.
- Download Models: Download the necessary pre-trained models from Hugging Face and place them in the designated "ckpts/" directory.
Inference
- Run the provided inference script:
bash inference/inference_single_image.sh
. - Explore the Gradio demo for an interactive experience.
- Try RealCustom on Dreamina for an elevated experience!
New Customization Framework: UNO
Check out the latest advancement! The UNO customization framework enhances RealCustom, offering even greater control and fidelity.
Citation
If RealCustom accelerates your research, cite the relevant papers:
@inproceedings { huang2024realcustom,
title = { RealCustom: narrowing real text word for real-time open-domain text-to-image customization},
author = { Huang, Mengqi and Mao, Zhendong and Liu, Mingcong and He, Qian and Zhang, Yongdong},
booktitle = { Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages = { 7476--7485},
year = { 2024}
}
@article { mao2024realcustom++,
title = { Realcustom++: Representing images as real-word for real-time customization},
author = { Mao, Zhendong and Huang, Mengqi and Ding, Fei and Liu, Mingcong and He, Qian and Zhang, Yongdong},
journal = { arXiv preprint arXiv:2408.09744},
year = { 2024}
}
@article { wu2025less,
title = { Less-to-More Generalization: Unlocking More Controllability by In-Context Generation},
author = { Wu, Shaojin and Huang, Mengqi and Wu, Wenxu and Cheng, Yufeng and Ding, Fei and He, Qian},
journal = { arXiv preprint arXiv:2504.02160},
year = { 2025}
}
RealCustom: Where Innovation Meets Imagination.
Unlock the power of RealCustom and bring your creative visions to life with unprecedented control and detail. Real-time text-to-image customization is now within your reach!