Follow
Tony (Tong) Geng
Tony (Tong) Geng
Other namesTong Geng
Assistant Professor, University of Rochester
Verified email at rochester.edu - Homepage
Title
Cited by
Cited by
Year
AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing
T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ...
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
3072020
I-GCN: A graph convolutional network accelerator with runtime locality enhancement through islandization
T Geng, C Wu, Y Zhang, C Tan, C Xie, H You, M Herbordt, Y Lin, A Li
MICRO-54: 54th annual IEEE/ACM international symposium on microarchitecture …, 2021
1192021
FPDeep: Acceleration and load balancing of CNN training on FPGA clusters
T Geng, T Wang, A Sanaullah, C Yang, R Xu, R Patel, M Herbordt
2018 IEEE 26th Annual International Symposium on Field-Programmable Custom …, 2018
1062018
Accelerating Transformer-based Deep Learning Models on FPGAs using Column Balanced Block Pruning
H Peng, S Huang, T Geng, A Li, W Jiang, H Liu, S Wang, C Ding
The 22nd International Symposium on Quality Electronic Design, 1-8, 2021
1042021
A framework for acceleration of CNN training on deeply-pipelined FPGA clusters with work and weight load balancing
T Geng, T Wang, A Sanaullah, C Yang, R Patel, M Herbordt
2018 28th international conference on field programmable logic and …, 2018
742018
FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters
T Geng, T Wang, A Li, X Jin, M Herbordt
IEEE Transactions on Computers, 2020
64*2020
Fully integrated FPGA molecular dynamics simulations
C Yang, T Geng, T Wang, R Patel, Q Xiong, A Sanaullah, C Wu, J Sheng, ...
Proceedings of the International Conference for High Performance Computing …, 2019
612019
Gcod: Graph convolutional network acceleration via dedicated algorithm and accelerator co-design
H You, T Geng, Y Zhang, A Li, Y Lin
2022 IEEE International Symposium on High-Performance Computer Architecture …, 2022
572022
Tripartite feature enhanced pyramid network for dense prediction
D Liu, J Liang, T Geng, A Loui, T Zhou
IEEE Transactions on Image Processing 32, 2678-2692, 2023
532023
A length adaptive algorithm-hardware co-design of transformer on fpga through sparse attention and dynamic pipelining
H Peng, S Huang, S Chen, B Li, T Geng, A Li, W Jiang, W Wen, J Bi, H Liu, ...
Proceedings of the 59th ACM/IEEE Design Automation Conference, 1135-1140, 2022
532022
LP-BNN: Ultra-low-latency BNN inference with layer parallelism
T Geng, T Wang, C Wu, C Yang, SL Song, A Li, M Herbordt
2019 IEEE 30th International Conference on Application-specific Systems …, 2019
482019
Clusterfomer: clustering as a universal visual learner
J Liang, Y Cui, Q Wang, T Geng, W Wang, D Liu
Advances in neural information processing systems 36, 2024
472024
Dissecting tensor cores via microbenchmarks: Latency, throughput and numeric behaviors
W Sun, A Li, T Geng, S Stuijk, H Corporaal
IEEE Transactions on Parallel and Distributed Systems 34 (1), 246-261, 2022
472022
BSTC: A novel binarized-soft-tensor-core design for accelerating bit-based approximated neural nets
A Li, T Geng, T Wang, M Herbordt, SL Song, K Barker
Proceedings of the international conference for high performance computing …, 2019
462019
O3BNN-R: An out-of-order architecture for high-performance and regularized BNN inference
T Geng, A Li, T Wang, C Wu, Y Li, R Shi, W Wu, M Herbordt
IEEE Transactions on Parallel and Distributed Systems 32 (1), 199-213, 2021
412021
Ghostsz: A transparent fpga-accelerated lossy compression framework
Q Xiong, R Patel, C Yang, T Geng, A Skjellum, MC Herbordt
2019 IEEE 27th Annual International Symposium on Field-Programmable Custom …, 2019
412019
Apnn-tc: Accelerating arbitrary precision neural networks on ampere gpu tensor cores
B Feng, Y Wang, T Geng, A Li, Y Ding
Proceedings of the international conference for high performance computing …, 2021
382021
Accel-gcn: High-performance gpu accelerator design for graph convolution networks
X Xie, H Peng, A Hasan, S Huang, J Zhao, H Fang, W Zhang, T Geng, ...
2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD), 01-09, 2023
342023
Autorep: Automatic relu replacement for fast private network inference
H Peng, S Huang, T Zhou, Y Luo, C Wang, Z Wang, J Zhao, X Xie, A Li, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
342023
G-CoS: GNN-accelerator co-search towards both better accuracy and efficiency
Y Zhang, H You, Y Fu, T Geng, A Li, Y Lin
2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021
342021
The system can't perform the operation now. Try again later.
Articles 1–20