Is `argmax` the only sampling method for the LLaDA model? Why are the model outputs sometimes filled with '\n'? How can I mitigate this effect?
This issue appears to be discussing a feature request or bug report related to the repository. Based on the content, it seems to be still under discussion. The issue was opened by redwyd and has received 1 comments.