Vasily Volkov
Vasily Volkov
Verified email at
Cited by
Cited by
Benchmarking GPUs to tune dense linear algebra
V Volkov, JW Demmel
SC'08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, 1-11, 2008
Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures
K Datta, M Murphy, V Volkov, S Williams, J Carter, L Oliker, D Patterson, ...
SC'08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, 1-12, 2008
Parallel computing experiences with CUDA
M Garland, S Le Grand, J Nickolls, J Anderson, J Hardwick, S Morton, ...
Micro, IEEE 28 (4), 13-27, 2008
Better performance at lower occupancy
V Volkov
Proceedings of the GPU technology conference, GTC 10, 16, 2010
LU, QR and Cholesky factorizations using vector capabilities of GPUs
V Volkov, J Demmel
Technical Report UCB/EECS-2008-49, EECS Department, University of California …, 2008
Fitting FFT onto the G80 architecture
V Volkov, B Kazian
University of California, Berkeley, 2008
Auto-tuning the 27-point stencil for multicore
K Datta, S Williams, V Volkov, J Carter, L Oliker, J Shalf, K Yelick
In Proc. iWAPT2009: The Fourth International Workshop on Automatic …, 2009
Understanding latency hiding on GPUs
V Volkov
UC Berkeley, 2016
Building an efficient hash table on the GPU
DA Alcantara, V Volkov, S Sengupta, M Mitzenmacher, JD Owens, ...
GPU Computing Gems Jade Edition, 39-53, 2012
Cloth animation with adaptively refined meshes
L Li, V Volkov
Proceedings of the Twenty-eighth Australasian conference on Computer Science …, 2005
Using GPUs to accelerate the bisection algorithm for finding eigenvalues of symmetric tridiagonal matrices
V Volkov, JW Demmel
Department of Computer Science, University of Tennessee, Knoxville, inst-UT …, 2008
Real-time refinement and simplification of adaptive triangular meshes
V Volkov, L Li
IEEE Visualization, 2003. VIS 2003., 155-162, 2003
Use registers and multiple outputs per thread on GPU
V Volkov
International Workshop on Parallel Matrix Algorithms and Applications 2010 …, 2010
Adaptive local refinement and simplification of cloth meshes
V Volkov, L Li
First International Conference on Information Technology & Applications …, 2002
Auto-Tuning Stencil Computations on Multicore and Accelerators.
K Datta, S Williams, V Volkov, J Carter, L Oliker, J Shalf, KA Yelick
Scientific Computing with Multicore and Accelerators, 219-253, 2010
Programming inverse memory hierarchy: case of stencils on GPUs
V Volkov
GPU Workshop for Scientific Computing, International Conference on Parallel …, 2010
Inflatable models
L Li, V Volkov
Journal of Computer Science and Technology 21 (2), 154-158, 2006
Unrolling parallel loops
V Volkov
Tutorial at the, 133, 2011
A microbenchmark to study GPU performance models
V Volkov
ACM SIGPLAN Notices 53 (1), 421-422, 2018
Using GPUs to accelerate linear algebra routines
V Volkov, J Demmel
URL http://www. cs. berkeley. edu/~ volkov/volkov08-parlab. pdf, 2008
The system can't perform the operation now. Try again later.
Articles 1–20