How to Evaluate the Effectiveness of Text-to-Game AI in Your Game