Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

/f/MachineLearning

[P] Up to 12X faster GPU inference on Bert, T5 and other transformers with OpenAI Triton kernels

Submitted by pommedeterresautee t3_ydqmjp on October 26, 2022 at 6:10 AM in MachineLearning

40 comments

352

Viewing a single comment thread. View all comments

[deleted] t1_itxsyu3 wrote on October 27, 2022 at 2:11 AM

[deleted]

Permalink

1

0 points (+0, −0)

Short URL:

http://metis.lti.cs.cmu.edu:9999/14054

MachineLearning

t5_2r3gv

Created October 1, 2022
Subscribe via RSS

Toolbox

Bans
Moderation log

Running Postmill