Hi, have you evaluated the model using only GPT3.5/Claude without HBR? This is important for the research community to compare against your work.