FlexGen ke enjene ea ho tsamaisa bots ea AI ea ChatGPT ho sistimi e le 'ngoe ea GPU

Sehlopha sa bafuputsi ba tsoang Univesithing ea Stanford, Univesithi ea California Berkeley, ETH Zurich, Sekolo sa Graduate ea Economics, Univesithi ea Carnegie Mellon, hammoho le Yandex le Meta, ba hatisitse khoutu ea mohloli oa enjene bakeng sa ho tsamaisa mefuta e meholo ea lipuo ho lisebelisoa. - tsamaiso e thata. Ka mohlala, enjene e fana ka bokhoni ba ho etsa ts'ebetso e hopotsang ChatGPT le Copilot ka ho sebelisa mohlala oa OPT-175B o koetlisitsoeng pele, o koahelang li-parameter tse limilione tse likete tse 175, k'homphieutheng e tloaelehileng e nang le karete ea litšoantšo ea lipapali tsa NVIDIA RTX3090 e nang le 24GB ea memori ea video. Khoutu e ngotsoe ka Python, e sebelisa moralo oa PyTorch mme e ajoa tlasa laesense ea Apache 2.0.

E kenyelletsa mohlala oa script bakeng sa ho theha bots e u lumellang ho khoasolla e 'ngoe ea mefuta ea lipuo tse fumanehang phatlalatsa 'me hang-hang u qale ho buisana (mohlala, ka ho tsamaisa taelo "python apps/chatbot.py -model facebook/opt-30b — -percent 0 100 100 0 100 0”). E le motheo, ho reretsoe ho sebelisa mohlala o moholo oa puo o hatisitsoeng ke Facebook, o koetliselitsoeng ho bokella BookCorpus (libuka tse likete tse 10), CC-Stories, Pile (OpenSubtitles, Wikipedia, DM Mathematics, HackerNews, joalo-joalo), Pushshift. io (e ipapisitse le data ea Reddit) le CCNewsV2 (boemo ba litaba). Mohlala o akaretsa li-tokens tse ka bang limilione tse likete tse 180 (800 GB ea data). Matsatsi a 33 a ts'ebetso ea sehlopha le 992 NVIDIA A100 80GB GPUs a sebelisitsoe ho koetlisa mohlala.

Ha e sebelisa mofuta oa OPT-175B tsamaisong e nang le NVIDIA T4 GPU (16GB) e le 'ngoe, enjene ea FlexGen e bonts'itse ts'ebetso e fihlang makhetlo a 100 ka lebelo ho feta litharollo tse fanoeng pele, e etsa hore tšebeliso ea mefuta e meholo ea lipuo e be theko e tlase le ho e lumella ho tsoela pele. litsamaiso tse se nang li-accelerator tse inehetseng. Ka nako e ts'oanang, FlexGen e ka hola ho bapisa lipalo le li-GPU tse ngata. Ho fokotsa boholo ba mohlala, ho sebelisoa leano la ho hatella paramethara le mochini oa caching oa mohlala.

Hajoale, FlexGen e ts'ehetsa feela mefuta ea lipuo tsa OPT, empa nakong e tlang bahlahisi le bona ba ts'episa ho eketsa tšehetso bakeng sa BLOOM (li-parameter tse limilione tse likete tse 176, li tšehetsa lipuo tse 46 le lipuo tse 13 tsa lenaneo), CodeGen (e ka hlahisa khoutu ka lipuo tse 22 tsa lenaneo) le Mefuta ea GLM. Mohlala oa puisano le bot e thehiloeng ho FlexGen le mohlala oa OPT-30B:

Motho: Lebitso la thaba e telele ka ho fetisisa lefatšeng ke mang?

Mothusi: Everest.

Motho: Ke rera leeto la sehopotso sa rona. Ke lintho life tseo re ka li etsang?

Mothusi: Ho na le lintho tse ngata tseo u ka li etsang bakeng sa sehopotso sa hau. Pele, o ka bapala likarete. Ea bobeli, u ka tsamaea ho ea hahlaula. Ea boraro, u ka ea musiamong.

Source: opennet.ru

Eketsa ka tlhaloso