Skip to content

Ft_v1 QAIC-profiler hotfix#994

Open
quic-akuruvil wants to merge 9 commits into
quic:ft_experimental_v1from
smedhe:ft_v1_profiler_hotfix
Open

Ft_v1 QAIC-profiler hotfix#994
quic-akuruvil wants to merge 9 commits into
quic:ft_experimental_v1from
smedhe:ft_v1_profiler_hotfix

Conversation

@quic-akuruvil
Copy link
Copy Markdown
Contributor

@quic-akuruvil quic-akuruvil commented May 18, 2026

Moves torch_dtype from training config to model config.

Adds explicit training.fp16 and training.bf16 flags with validation.

Hardens QAIC profiler and op-by-op verifier callback handling.

Updates configs, docs, and tests to match the new precision schema.

Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
@smedhe smedhe force-pushed the ft_v1_profiler_hotfix branch from d887cfd to aa24bb9 Compare May 18, 2026 09:40
Copy link
Copy Markdown
Contributor Author

@quic-akuruvil quic-akuruvil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

MAke sure to run all local unit tests. ALso the existing distributed techniques on smaller samples.

smedhe added 5 commits May 18, 2026 17:22
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
…6 flags

Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
…ling

Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>

monkeypatch.setattr(callbacks_module, "init_qaic_profiling", _mock_start)

callback = QAICProfilerCallback(start_step=3, end_step=9, trace_dir="/tmp/hw-trace", device_ids=[2])
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is /tmp dir used here. It is best to avoid /tmp dir from the root.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using the project root now and added cleanup code too

lambda use_profiler, device_type, trace_dir=None: calls.append((use_profiler, device_type, trace_dir)),
)

callback = QAICProfilerCallback(start_step=0, trace_dir="/tmp/hw-trace")
Copy link
Copy Markdown
Contributor Author

@quic-akuruvil quic-akuruvil May 18, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as above. the tmp folder from root dir fills fast. So move to some home folder, even for temporary results. I think by default these logs will be dumped into in qaic-dumps directory (in current Qeff folder). Can we not maintain same path?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using the project root now and added cleanup code too

Comment thread docs/source/config.md
- Use `qaic_op_by_op_verifier_callback` with `training.fp16: false` and `model.torch_dtype: fp32`, for only `1-3` steps.

**References to some commonly used Hugging Face callbacks**:
https://huggingface.co/docs/transformers/en/main_classes/callback
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Update the docs for model dtype arg also.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added the documentation for torch_dtype, fp16 and bf16 params

@quic-akuruvil
Copy link
Copy Markdown
Contributor Author

Summarize the changes in the description of pR

smedhe added 3 commits May 19, 2026 15:25
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants