Follow
Filip Pavetic
Filip Pavetic
Verified email at google.com
Title
Cited by
Cited by
Year
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
25622023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
10312024
Scaling vision transformers to 22 billion parameters
M Dehghani, J Djolonga, B Mustafa, P Padlewski, J Heek, J Gilmer, ...
International Conference on Machine Learning, 7480-7512, 2023
5482023
Pali-x: On scaling up a multilingual vision and language model
X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ...
arXiv preprint arXiv:2305.18565, 2023
1632023
Object scene representation transformer
MSM Sajjadi, D Duckworth, A Mahendran, S Van Steenkiste, F Pavetic, ...
Advances in neural information processing systems 35, 9512-9524, 2022
1092022
Flexivit: One model for all patch sizes
L Beyer, P Izmailov, A Kolesnikov, M Caron, S Kornblith, X Zhai, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
992023
Pali-3 vision language models: Smaller, faster, stronger
X Chen, X Wang, L Beyer, A Kolesnikov, J Wu, P Voigtlaender, B Mustafa, ...
arXiv preprint arXiv:2310.09199, 2023
822023
The auto arborist dataset: a large-scale benchmark for multiview urban forest monitoring under domain shift
S Beery, G Wu, T Edwards, F Pavetic, B Majewski, S Mukherjee, S Chan, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
432022
$ LCSk $++: Practical similarity metric for long strings
F Pavetić, G Žužić, M Šikić
arXiv preprint arXiv:1407.2407, 2014
102014
A study of autoregressive decoders for multi-tasking in computer vision
L Beyer, B Wan, G Madan, F Pavetic, A Steiner, A Kolesnikov, AS Pinto, ...
arXiv preprint arXiv:2303.17376, 2023
92023
On Scaling Up a Multilingual Vision and Language Model
X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
62024
Multi-step sequence alignment
PG Anders, F Pavetic
US Patent 9,959,448, 2018
62018
Methods, systems, and media for detecting abusive stereoscopic videos by generating fingerprints for multiple portions of a video frame
V Zamaraiev, F Pavetic
US Patent 9,872,056, 2018
62018
Fast and simple algorithms for computing both and
F Pavetić, I Katanić, G Matula, G Žužić, M Šikić
arXiv preprint arXiv:1705.07279, 2017
62017
Locca: Visual pretraining with location-aware captioners
B Wan, M Tschannen, Y Xian, F Pavetic, IM Alabdulmohsin, X Wang, ...
Advances in Neural Information Processing Systems 37, 116355-116387, 2025
42025
Detecting multiple parts of a screen to fingerprint to detect abusive uploading videos
F Pavetic, MR Konrad, H Pasula
US Patent 10,614,539, 2020
42020
Detecting multiple parts of a screen to fingerprint to detect abusive uploading videos
F Pavetic, MR Konrad, H Pasula
US Patent 9,972,060, 2018
32018
Scalable and Cost-Efficient Information Retrieval Architecture for Massive Datasets
F Pavetic, D Simcha, AT Voicu, F Chern, PW Sun, R Guo, HM Pasula, ...
US Patent App. 17/886,860, 2024
12024
Randomly generated blobs to improve object-detection training for framed video content
M Kandpal, B Ashirmatov, F Pavetic
US Patent 11,734,908, 2023
12023
Methods, systems, and media for detecting abusive stereoscopic videos by generating fingerprints for multiple portions of a video frame
V Zamaraiev, F Pavetic
US Patent 10,499,097, 2019
12019
The system can't perform the operation now. Try again later.
Articles 1–20