Partnering with Generative AI: Experimental Evaluation of Human-Led and Model-Led Interaction in Human-AI Co-Creation
Sebastian Maier, Manuel Schneider, Stefan Feuerriegel
Human-led modes (suggestions and questions) produced significantly higher-quality creative outputs than model-led generation. Users reported feeling more ownership and control when the AI scaffolded their thinking rather than generating content directly. The question-based mode performed best for divergent thinking tasks.
LLM interfaces for creative work default to model-led modes where the AI generates content proactively. Unclear whether human-led modes—where the AI asks questions or offers suggestions instead—produce better creative outcomes.
Method: Randomized controlled experiment with 486 participants compared three collaboration modes: human-led with AI suggestions, human-led with AI questions, and model-led with AI generating content. Measured creative output quality, user satisfaction, and perceived collaboration effectiveness across all three conditions.
Caveats: Single-session creative writing task. Long-term effects on creative skill development or professional workflows unknown.
Reflections: Do human-led modes remain superior across different creative domains (visual design, music composition, code architecture)? · How does the quality gap between human-led and model-led modes change as users gain experience with AI tools? · Can hybrid modes that switch between human-led and model-led based on task phase outperform either pure approach?