Start with the right input
There are two common ways in. Text-to-3D works well when the idea is still loose and you want to explore shape with language. Image-to-3D works better when you already have a photo, sketch, or concept image that carries the silhouette.
That choice matters because it changes what you expect from the first result. Text is good for exploration. Images are better for guidance.