feat(dataset): add final smishing rewrites, documentation, and report#46
feat(dataset): add final smishing rewrites, documentation, and report#46s223737886 wants to merge 3 commits intoHardhat-Enterprises:devfrom
Conversation
dec1belPP
left a comment
There was a problem hiding this comment.
Hey @s223737886, there are some changes required before we can review your PR:
- Your task's scope is to improve the dataset so you don't need to be having any changes done to any existing JavaScript or other Python files.
- Rename the old dataset or keep it as it is instead of deleting it.
- Resolve any conflicts manually before putting in your PR.
Please note that your PR will not be reviewed till all of these changes are made. Thank you.
|
I have made the required changes and committed the repo again and moreover I have included the report of my pull request 47 into the latest commit changes as it was closed. The report is named Smishing-report. |
Hey @s223737886, this PR is still not at an acceptable standard for review. To reiterate, please:
Please note that your PR will not be reviewed till all of these changes are made. Thank you. |
471e726 to
4ad0162
Compare
|
Thanks for the feedback Pasindu I've now cleaned the branch and made the following updates based on your instructions: Retained the original DatasetCombined.csv without any changes. Added a new file Dataset.csv with the rewritten smishing messages. Included only relevant changes related to the dataset: documentation (DatasetDocumentation) and the report (Smishing_Report.docx). Removed all unrelated JavaScript or Python file changes from the PR. Verified the branch is up-to-date with origin/dev. Let me know if any other changes are needed. Thank you! |
Summary
This pull request delivers the finalized dataset and supporting documentation for the Smishing Detection backend project, specifically for the Microsoft Planner task titled: Smishing Message Rewriting for Training and Smishing-report and also it contains a report named Smishing-report that explores on how the working of smishing-attack and why they're effective
What’s Included
✅ Final processed dataset located at:
machine-learning/datasets/Dataset.csvsource,intent_type,malicious,threat_level,linked_to, etc.✅ Dataset documentation under:
machine-learning/projects/DatasetDocumentationdataset_schema.mdrewriting_strategy.mdsmishing_taxonomy.mdtraceability_mapping.mdpreprocessing_guidelines.mdReport under: 'machine-learning/projects/Reports/Smishing_Report.docx
although the report is quite different from the above work but it delves into the working of smishing-attack and why they are so effective
Conventions Followed
sms-rewriting/kalpna(named per contribution guideline format)feat,chore, etc.)devbranch (not main)DatasetCombined.csvwas removed as part of cleanupNotes
Planner Task
This PR corresponds to the Microsoft Planner task: Smishing Message Rewriting for Training and Smishing-report
Dataset.csv
dataset_schema.md
preprocessing_guidelines.md
README.md
rewriting_strategy.md
smishing_taxonomy.md
traceability_mapping.md