Tim Dettmers

Cited by

	All	Since 2019
Citations	6042	5928
h-index	12	12
i10-index	15	15

2600

1300

650

1950

2017201820192020202120222023202420 81 195 404 607 886 2513 1311

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Luke ZettlemoyerUniversity of Washington; MetaVerified email at cs.washington.edu
Pasquale MinerviniUniversity of Edinburgh, University College London, ELLISVerified email at ed.ac.uk
Sebastian RiedelHonorary Professor @ University College London, Researcher @ DeepMindVerified email at cs.ucl.ac.uk
Mike LewisFacebook AI ResearchVerified email at fb.com
Alexander BorzunovOpenAIVerified email at openai.com
Artidoro PagnoniUniversity of WashingtonVerified email at cs.washington.edu
Max RyabininTogether AIVerified email at together.ai

Tim Dettmers

University of Washington

Verified email at cs.washington.edu - Homepage

Deep Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Convolutional 2d knowledge graph embeddings T Dettmers, P Minervini, P Stenetorp, S Riedel AAAI 2018, 2018	2639	2018
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1162	2023
Qlora: Efficient finetuning of quantized llms T Dettmers, A Pagnoni, A Holtzman, L Zettlemoyer NeurIPS 2023 (Oral), 2023	676	2023
Llm. int8 (): 8-bit matrix multiplication for transformers at scale T Dettmers, M Lewis, Y Belkada, L Zettlemoyer NeurIPS 2022, 2022	434*	2022
Sparse networks from scratch: Faster training without losing performance T Dettmers, L Zettlemoyer arXiv preprint arXiv:1907.04840, 2019	326	2019
8-bit Approximations for Parallelism in Deep Learning T Dettmers ICLR 2016, 2016	208	2016
Base layers: Simplifying training of large, sparse models M Lewis, S Bhosale, T Dettmers, N Goyal, L Zettlemoyer ICML 2021, 2021	167	2021
8-bit Optimizers via Block-wise Quantization T Dettmers, M Lewis, S Shleifer, L Zettlemoyer ICLR 2022 (Spotlight), 2022	127	2022
Branch-train-merge: Embarrassingly parallel training of expert language models M Li, S Gururangan, T Dettmers, M Lewis, T Althoff, NA Smith, ... arXiv preprint arXiv:2208.03306, 2022	84	2022
The case for 4-bit precision: k-bit inference scaling laws T Dettmers, L Zettlemoyer ICML 2023, 2023	75	2023
Spqr: A sparse-quantized representation for near-lossless llm weight compression T Dettmers, R Svirschevski, V Egiazarian, D Kuznedelev, E Frantar, ... arXiv preprint arXiv:2306.03078, 2023	63	2023
Petals: Collaborative inference and fine-tuning of large models A Borzunov, D Baranchuk, T Dettmers, M Ryabinin, Y Belkada, ... ACL 2022, Demonstration, 2022	35*	2022
Jack the reader-A machine reading framework D Weissenborn, P Minervini, T Dettmers, I Augenstein, J Welbl, ... arXiv preprint arXiv:1806.08727, 2018	12	2018
Swarm parallelism: Training large models can be surprisingly communication-efficient M Ryabinin, T Dettmers, M Diskin, A Borzunov NeurIPS 2023, 2023	11	2023
Stable and low-precision training for large-scale vision-language models M Wortsman, T Dettmers, L Zettlemoyer, A Morcos, A Farhadi, L Schmidt NeurIPS 2023, 2023	10	2023
Training transformers together A Borzunov, M Ryabinin, T Dettmers, Q Lhoest, L Saulnier, M Diskin, ... NeurIPS 2021 Demonstration, 2022	9	2022
High performance natural language processing G Ilharco, C Ilharco, I Turc, T Dettmers, F Ferreira, K Lee EMNLP 2020, Tutorial, 2020	3	2020
Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model LZ Liu, T Dettmers, XV Lin, V Stoyanov, X Li EMNLP 2023, 2023	1	2023
MatFormer: Nested Transformer for Elastic Inference S Kudugunta, A Kusupati, T Dettmers, K Chen, I Dhillon, Y Tsvetkov, ... arXiv preprint arXiv:2310.07707, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors