I-FlexGen yinjini yokusebenzisa i-ChatGPT-efana ne-AI bots kwiinkqubo ze-GPU enye

Iqela labaphandi abavela kwiYunivesithi yaseStanford, iYunivesithi yaseCalifornia eBerkeley, i-ETH Zurich, i-Graduate School of Economics, iYunivesithi yaseCarnegie Mellon, kunye neYandex kunye neMeta, bapapashe ikhowudi yomthombo we-injini yokuqhuba iimodeli zolwimi ezinkulu kwimithombo. -inkqubo ezithintelweyo. Ngokomzekelo, i-injini inika amandla okudala ukusebenza okukhumbuza i-ChatGPT kunye ne-Copilot ngokuqhuba imodeli ye-OPT-175B eqeqeshwe kwangaphambili, egubungela i-175 yeebhiliyoni zeeparamitha, kwikhompyutheni eqhelekileyo kunye nekhadi lemizobo ye-NVIDIA RTX3090 yokudlala exhotywe nge-24GB yememori yevidiyo. Ikhowudi ibhalwe kwiPython, isebenzisa isakhelo sePyTorch kwaye ihanjiswa phantsi kwelayisensi ye-Apache 2.0.

Ibandakanya iscript somzekelo wokudala i-bots ekuvumela ukuba ukhuphele enye yeemodeli zolwimi ezikhoyo kwaye kwangoko uqalise ukunxibelelana (umzekelo, ngokusebenzisa umyalelo "python apps/chatbot.py -model facebook/opt-30b β€” -percent 0 100 100 0 100 0”). Njengesiseko, kucetywa ukuba kusetyenziswe imodeli yolwimi olukhulu olupapashwe nguFacebook, oqeqeshwe kwiingqokelela zeBookCorpus (10 amawaka eencwadi), CC-Stories, Pile (OpenSubtitles, Wikipedia, DM Mathematics, HackerNews, njl.), Pushshift. io (ngokusekwe kwidatha yeReddit ) kunye neCCNewsV2 (uvimba weendaba). Imodeli ihlanganisa malunga ne-180 yeebhiliyoni zamathokheni (i-800 GB yedatha). Iintsuku ezingama-33 zokusebenza kweqela kunye ne-992 NVIDIA A100 80GB GPUs zachithwa ekuqeqesheni imodeli.

Xa uqhuba imodeli ye-OPT-175B kwinkqubo ene-NVIDIA T4 GPU (16GB) enye, injini ye-FlexGen ibonise ukusebenza ukuya kumaxesha angama-100 ngokukhawuleza kunezisombululo ezibonelelwe ngaphambili, okwenza ukusetyenziswa kweemodeli zeelwimi ezinkulu zifikeleleke kwaye zivumele ukuba ziqhube phambili. iinkqubo ezingenayo i-accelerators ezizinikeleyo. Kwangaxeshanye, iFlexGen inokulinganisa ukulinganisa izibalo ngeeGPU ezininzi. Ukunciphisa ubungakanani bemodeli, iskimu sokunyanzeliswa kweparamitha yeproprietary kunye nemodeli ye-caching mechanism iyasetyenziswa.

Okwangoku, i-FlexGen ixhasa kuphela iimodeli zeelwimi ze-OPT, kodwa kwixesha elizayo abaphuhlisi bathembisa ukongeza inkxaso ye-BLOOM (i-176 yeebhiliyoni zeeparamitha, ixhasa iilwimi ezingama-46 kunye neelwimi ezili-13 zeprogram), i-CodeGen (inokuvelisa ikhowudi kwiilwimi ezingama-22) kunye Iimodeli ze-GLM Umzekelo wengxoxo kunye nebhoti esekwe kwiFlexGen kunye nemodeli ye-OPT-30B:

Buntu: Yintoni igama leyona ntaba iphakamileyo ehlabathini?

Umncedisi: Everest.

Buntu: Ndilungiselela uhambo lwesikhumbuzo sethu. Ziziphi izinto esinokuzenza?

Umncedisi: Ewe, zininzi izinto onokuzenza kwisikhumbuzo sakho. Okokuqala, ungadlala amakhadi. Okwesibini, ungahamba uhambo. Okwesithathu, ungaya kwimyuziyam.

umthombo: opennet.ru

Yongeza izimvo