RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing
[Read More]
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations
Paper Review
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations
[Read More]
HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Paper Review
HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
[Read More]
Distributed Deep Learning Using Synchronous Stochastic Gradient Descent
Paper Review
Distributed Deep Learning Using Synchronous Stochastic Gradient Descent
[Read More]
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Paper Review
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
[Read More]