-
Notifications
You must be signed in to change notification settings - Fork 41
CCT Attention Training on Siracusa #69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 109 commits
Commits
Show all changes
112 commits
Select commit
Hold shift + click to select a range
9ec13f9
Add classifier training support
runwangdl f1a0491
Fix L3 DMA and Maxpool Bugs
runwangdl 29baf2c
WIP Static Memory Allocation of IOs
Victor-Jung 25be229
Temporary fix broken float softmax
Victor-Jung da56cbe
Fix lifetime of aliased input buffers
Victor-Jung 721f747
Fix output buffer lifetime
Victor-Jung 78685e5
Linting
Victor-Jung 02b5435
WIP fix output buffer lifetime
Victor-Jung a2d67a0
Change RQHardswish dim due to compiler bug
Victor-Jung bdd92de
Fix typo
Victor-Jung 20b1f8b
Fix duplicated IO in memory allocation visualization
Victor-Jung c708069
Fix the Constant Tensor offset to not take into account IO since they…
Victor-Jung b6e2448
Add new attribute to Variable and Transient buffer to annotate if the…
Victor-Jung 7e96f18
Adapt calculateLifetime to use buffer I/O annotation
Victor-Jung b923520
Fix typo
Victor-Jung f4cb9e0
Remove IO buffer name and refactor var name
Victor-Jung 435cc9d
Linting
Victor-Jung 731f39f
Test the correctness of the memory map after memory allocation
Victor-Jung dd1370c
Allocate memory arena first
Victor-Jung 8bfdb13
correct DMA lengh of copy assertion
runwangdl f01eb7f
Align memory allocation test
Victor-Jung 031dc79
delete redundant shell scripts
runwangdl 58e18da
Merge branch 'devel' into PULPCCTL3_16_16_64
runwangdl ac2d879
Update node with multioutput to single output
runwangdl 6a7198b
add softmaxcrossentropygrad tiling
runwangdl 360aef7
Add softmaxcrossentropylossgrad tiling
runwangdl bc48582
Merge branch 'PULPCCTL3_16_16_64' into GEMM_training_tiled
runwangdl b6542ba
Fix CI issue
runwangdl fe208d0
Fix CI bugs
runwangdl 4a21359
update CI
runwangdl a0dcb6d
Improve memory alloc visualization
Victor-Jung 91f12f0
Add and pass test for CCT gemmtraining 1_16_16_8 to 128
runwangdl d1e1ebf
update CI with 8-128 dim CCT last gemm training test
runwangdl 86a2e99
Add SGD support for PULP Open
runwangdl bdacd2f
Update CCT training test with sgd
runwangdl b5421cc
Multi-level profiling + Linting
Victor-Jung 99035f0
Update Changelog
runwangdl 62e87d3
Merge branch 'devel' into GEMM_training_tiled
runwangdl 15ea3ec
Solved issues caused by merging conflicts
runwangdl a644fdf
Solved Review Comments
runwangdl 643e160
Resolving conflicts
runwangdl 80a9518
Reresolve the conflict
runwangdl 501775d
Solving CI issues
runwangdl 65a56b7
fix linting errors
runwangdl 03c3f4a
gelu sigmoid approximation
runwangdl 7e141fd
gelu parallel + unroll
runwangdl c3ee783
Float Matmul Parallel on M
runwangdl 47d8c19
Softmax Parallel and Softmax Op Support
runwangdl ccba380
conv parallel without im2col
runwangdl fafcedf
PULP Layernorm Parallel
runwangdl 147e68f
Fixed CI issues
runwangdl 6e07dc9
fixing linting
runwangdl 8b2f685
Merge branch 'devel' into devel_CCT_Optim
runwangdl 9c0b8f6
Enlarge CI floatconv tiling L1 size for 8 core and delete CCT 128 tes…
runwangdl 4c36de2
matmul 1*4 unrolling
runwangdl 28ec2ca
Add computeOp support for CCT necessary kernels
runwangdl bf1f8ae
Add openlibm expf
runwangdl deac9ce
add relu, mul, maxpool ops num
runwangdl 3b12187
Optimize parallel for multiple kernels
runwangdl 49da947
Merge branch 'devel' into devel_CCT_Optim
runwangdl 47961b9
Merge branch 'devel' into devel_CCT_Optim
runwangdl 8907532
Change ConvTileConstraint to only tile on outchannel
runwangdl 133f9ae
Fix error in gelu
runwangdl f25127d
Fix Linting Issues
runwangdl 6f3f585
Merge branch 'devel' into devel_CCT_Optim
runwangdl 4ffea9b
Change CI tests
runwangdl 81c3460
profilling string change to const static
runwangdl 4af69de
Fix profiling dual loop issue
runwangdl e819626
Add RV32IMF Picolibc support for Siracusa platform
runwangdl fa0cc37
Build Docker for new gvsoc for testing
runwangdl ac56ca2
Gvsoc Small test
runwangdl fd6c99d
Add Redmule Platform, Engline, Tiler, and Deployer
runwangdl 2862f29
Add rv32imf.txt to build docker
runwangdl 9ef9cc2
Update GVSOC hash
runwangdl 10de9f6
matmul delicate constraints for Redmule
runwangdl efab54c
Merge branch 'devel_CCT_Optim' into redmule_platform
runwangdl 37670e6
conv with redmule
runwangdl 08b7e23
Add CCT 32 test
runwangdl e42b3d6
xtensor gvsoc docker build
runwangdl 823d847
add softmaxgrad tileconstraint
runwangdl 212ff3c
LayernormGrad and CCT MLP Training Graph
runwangdl d7346a5
Merge branch 'devel' into exp/heterogeneous-memory-placement
runwangdl c51694b
Fix Layernormgrad
runwangdl 3efa661
Add Gelugrad
runwangdl aee7651
Merge branch 'exp/heterogeneous-memory-placement' into AttentionTraining
runwangdl b40cbd7
GEMM with Redmule
runwangdl 203f095
Efficient GEMM
runwangdl 7835c5a
reducesum tileconstraint
runwangdl 21294bb
temporary deactiate transposesplit otherwise kq training failed
runwangdl 90689e2
merge devel
runwangdl 5c3f287
gemm no bias + input in name issue for codegenerate
runwangdl 3271c3a
Parallelization and Optimization of CCT Inference and Training Kernel…
runwangdl 7f99f2c
Adapation for Merging Devel
runwangdl 2b46d2d
AttentionTraining Support
runwangdl bc3f951
clean unwanted deeplotest
runwangdl fe13842
Merge branch 'devel' into AttentionTraining
runwangdl d867f73
Remove Redmule Content from this branch
runwangdl a7d6903
Fix Bugs after merge
runwangdl 0806442
Update CCT training testcases
runwangdl df6e698
Add CCT2 Training to CI
runwangdl c28300a
Fix CI errors
runwangdl 06fa447
Remove redundant files
runwangdl 3b2af3f
Decrease unneccessary changes compared with devel
runwangdl caa751a
Fix transposesplit samenaming issue & update GEMM no bias for tiling
runwangdl 179262e
Fixing Linting
runwangdl 0e16453
Remove redundant changes
runwangdl 3f5a042
Fix linting again
runwangdl 5502d04
Merge latest devel including TinyViT tiling support
runwangdl a292765
[AttentionTraining] Fixed PR Comments
runwangdl 7bf6395
[AttentionTraining] Fix Linting
runwangdl 00d542f
[AttentionTraining] Remove three input add binding
runwangdl 63cd06a
[AttentionTraining] Remove formatting_report.log generated from local…
runwangdl File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.