[IR2Vec] Add embeddings mode to llvm-ir2vec tool by svkeerthy · Pull Request #147844 · llvm/llvm-project

svkeerthy · 2025-07-09T22:44:32Z

Add embedding generation functionality to the llvm-ir2vec tool, complementing the existing triplet generation mode.

This change completes the IR2Vec tool by adding the embedding generation functionality, which was previously mentioned as a TODO item. The tool now supports both triplet generation for vocabulary training and embedding generation using a trained vocabulary.

svkeerthy · 2025-07-09T22:44:59Z

This stack of pull requests is managed by Graphite. Learn more about stacking.

boomanaiden154

Premerge failures here also look relevant.

llvm/test/tools/llvm-ir2vec/embeddings.ll

llvm/tools/llvm-ir2vec/llvm-ir2vec.cpp

github-actions · 2025-07-11T21:37:35Z

✅ With the latest revision this PR passed the Python code formatter.

boomanaiden154

Minor style nits, otherwise LGTM.

boomanaiden154 · 2025-07-14T15:44:07Z

llvm/tools/llvm-ir2vec/llvm-ir2vec.cpp


 using namespace llvm;
-using namespace ir2vec;
+using namespace llvm::ir2vec;


Instead of using statements, it might be better to wrap everything outside of main in an anonymous namespace inside the llvm::ir2vec namespace. I'm not sure what the coding standards are, but that's the pattern I see in other tools like llvm-exegesis.

boomanaiden154 · 2025-07-14T15:45:21Z

llvm/tools/llvm-ir2vec/llvm-ir2vec.cpp

+
+    // Generate embeddings based on the specified level
+    switch (Level) {
+    case FunctionLevel: {


Does clang-format not let you indent here?

Wierdly yes!

kazutakahirata

LGTM. Thanks!

svkeerthy · 2025-07-17T18:58:58Z

Merge activity

Jul 17, 6:58 PM UTC: A user started a stack merge that includes this pull request via Graphite.
Jul 17, 7:04 PM UTC: Graphite rebased this pull request as part of a merge.
Jul 17, 7:06 PM UTC: @svkeerthy merged this pull request with Graphite.

) Add a new LLVM tool `llvm-ir2vec`. This tool is primarily intended to generate triplets for training the vocabulary (#141834) and to potentially generate the embeddings in a stand alone manner. This PR introduces the tool with triplet generation functionality. In the upcoming PRs I'll add scripts under `utils/mlgo` to complete the vocabulary tooling. #147844 adds embedding generation logic to the tool. (Tracking issue - #141817)

svkeerthy changed the title ~~IR2Vec Tool Enhancements~~ [IR2Vec] Add embeddings mode to llvm-ir2vec tool Jul 9, 2025

svkeerthy marked this pull request as ready for review July 9, 2025 22:55

svkeerthy requested review from boomanaiden154, kazutakahirata, mtrofin and snehasish July 9, 2025 22:57

boomanaiden154 reviewed Jul 10, 2025

View reviewed changes

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from 7c4d86d to 5f1f3fe Compare July 11, 2025 19:54

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch 2 times, most recently from bf757c0 to 684d298 Compare July 11, 2025 21:35

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch 2 times, most recently from 2d88b38 to 6fd2dca Compare July 11, 2025 22:10

svkeerthy requested a review from boomanaiden154 July 11, 2025 22:10

boomanaiden154 approved these changes Jul 14, 2025

View reviewed changes

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch from 6fd2dca to 4e92c2b Compare July 14, 2025 17:40

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from 5f1f3fe to 744b38b Compare July 14, 2025 17:40

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch from 4e92c2b to f975249 Compare July 14, 2025 18:02

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from 744b38b to 51b0120 Compare July 14, 2025 18:11

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch 2 times, most recently from ab12375 to a3b518b Compare July 14, 2025 20:45

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from 51b0120 to e931cf1 Compare July 14, 2025 20:45

svkeerthy mentioned this pull request Jul 14, 2025

[IR2Vec] Adding documentation for llvm-ir2vec tool #148719

Merged

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from e931cf1 to 0f1720f Compare July 14, 2025 23:40

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch 2 times, most recently from f2498dc to 7b801df Compare July 16, 2025 22:49

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from 0f1720f to 52ec5db Compare July 16, 2025 22:49

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from 52ec5db to 36fe251 Compare July 16, 2025 23:32

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch from 7b801df to df6bdef Compare July 16, 2025 23:32

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from 36fe251 to 47d402c Compare July 16, 2025 23:46

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch from df6bdef to 0ee74a8 Compare July 16, 2025 23:46

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from f4181fd to 7f45a74 Compare July 17, 2025 18:04

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch from 0ee74a8 to c0360c7 Compare July 17, 2025 18:04

kazutakahirata approved these changes Jul 17, 2025

View reviewed changes

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool branch from 7f45a74 to 74e3b78 Compare July 17, 2025 19:00

Base automatically changed from users/svkeerthy/07-09-ir2vec_tool to main July 17, 2025 19:03

IR2Vec Tool Enhancements

537495c

svkeerthy force-pushed the users/svkeerthy/07-09-ir2vec_tool_enhancements branch from c0360c7 to 537495c Compare July 17, 2025 19:04

svkeerthy merged commit 70e2319 into main Jul 17, 2025
6 checks passed

svkeerthy deleted the users/svkeerthy/07-09-ir2vec_tool_enhancements branch July 17, 2025 19:06

This was referenced Jul 23, 2025

test abhinavgaba/llvm-project#2

Closed

Add dataFence plugin interface abhinavgaba/llvm-project#3

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IR2Vec] Add embeddings mode to llvm-ir2vec tool#147844

[IR2Vec] Add embeddings mode to llvm-ir2vec tool#147844
svkeerthy merged 1 commit intomainfrom
users/svkeerthy/07-09-ir2vec_tool_enhancements

svkeerthy commented Jul 9, 2025 •

edited

Loading

Uh oh!

svkeerthy commented Jul 9, 2025 •

edited

Loading

Uh oh!

boomanaiden154 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jul 11, 2025 •

edited

Loading

Uh oh!

boomanaiden154 left a comment

Uh oh!

boomanaiden154 Jul 14, 2025

Uh oh!

svkeerthy Jul 14, 2025

Uh oh!

boomanaiden154 Jul 14, 2025

Uh oh!

svkeerthy Jul 14, 2025

Uh oh!

kazutakahirata left a comment

Uh oh!

svkeerthy commented Jul 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

svkeerthy commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

svkeerthy commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

boomanaiden154 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

boomanaiden154 left a comment

Choose a reason for hiding this comment

Uh oh!

boomanaiden154 Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

svkeerthy Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

boomanaiden154 Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

svkeerthy Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

kazutakahirata left a comment

Choose a reason for hiding this comment

Uh oh!

svkeerthy commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

svkeerthy commented Jul 9, 2025 •

edited

Loading

svkeerthy commented Jul 9, 2025 •

edited

Loading

github-actions bot commented Jul 11, 2025 •

edited

Loading

svkeerthy commented Jul 17, 2025 •

edited

Loading