-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Open
Labels
community-requestneeds-follow-upIssue needs follow-upIssue needs follow-upquestionFurther information is requestedFurther information is requested
Description
Your question
Hi @Phlip79 , this is regarding the issue I created earlier (see below) related to nvfp4. One of the follow on activities I am trying to do is to gauge the accuracy of the SFT trained model using nvfp4. After completion of the SFT training run using nvfp4, I tried accessing the model using SgLang inference serving engine. The output of the inference request seems to be all gibberish. It all seems to work fine without any quantization. Is there anything I am missing that is messing up the model while doing post SFT training using nvfp4? Thanks.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
community-requestneeds-follow-upIssue needs follow-upIssue needs follow-upquestionFurther information is requestedFurther information is requested