ืคืจื•ื™ืงื˜ RedPajama ืžืคืชื— ืžืขืจืš ื ืชื•ื ื™ื ืคืชื•ื— ืœืžืขืจื›ื•ืช ื‘ื™ื ื” ืžืœืื›ื•ืชื™ืช

ืคืจื•ื™ืงื˜ ืฉื™ืชื•ืคื™ ืฉืœ RedPajama ืžื•ืฆื’ ืœื™ืฆื™ืจืช ืžื•ื“ืœื™ื ืฉืœ ืœืžื™ื“ืช ืžื›ื•ื ื” ืคืชื•ื—ื” ื•ืชืฉื•ืžื•ืช ืื™ืžื•ืŸ ื ืœื•ื•ืช ืฉื ื™ืชืŸ ืœื”ืฉืชืžืฉ ื‘ื”ืŸ ืœื‘ื ื™ื™ืช ืขื•ื–ืจื™ื ืื™ื ื˜ืœื™ื’ื ื˜ื™ื™ื ืฉืžืชื—ืจื™ื ื‘ืžื•ืฆืจื™ื ืžืกื—ืจื™ื™ื ื›ื’ื•ืŸ ChatGPT. ืฆืคื•ื™ ื›ื™ ื ื•ื›ื—ื•ืชื ืฉืœ ื ืชื•ื ื™ ืงื•ื“ ืคืชื•ื— ื•ืžื•ื“ืœื™ื ืฉืœ ืฉืคื•ืช ื’ื“ื•ืœื™ื ืชืกื™ืจ ืืช ื”ืžื’ื‘ืœื•ืช ืฉืœ ืฆื•ื•ืชื™ื ืขืฆืžืื™ื™ื ื”ืขื•ืกืงื™ื ื‘ืžื—ืงืจ ื‘ืชื—ื•ื ืœืžื™ื“ืช ืžื›ื•ื ื”, ื•ืชืคืฉื˜ ืืช ื™ืฆื™ืจืช ืžืขืจื›ื•ืช ื“ื™ืืœื•ื’ ืžื™ื•ื—ื“ื•ืช. ืืจื’ื•ื ื™ื ื•ืงื”ื™ืœื•ืช ื›ืžื• Together, Ontocord.ai, ETH DS3Lab, Stanford CRFM, Hazy Research ื•-MILA Quรฉbec AI Institute ื”ืฆื˜ืจืคื• ืœืขื‘ื•ื“ื” ืขืœ ื”ืคืจื•ื™ืงื˜.

ื”ืฆืขื“ ื”ืจืืฉื•ืŸ ื”ื™ื” ืคืจืกื•ื ืžืขืจืš ื”ื ืชื•ื ื™ื RedPajama-Data-1T ื‘ื ืคื— 1.2 ื˜ืจื™ืœื™ื•ืŸ ืืกื™ืžื•ืŸ ืœืื™ืžื•ืŸ ืžื•ื“ืœื™ื ืฉืœ ืฉื™ื—ื”. ืขืจื›ืช RedPajama ืžืฉื›ืคืœืช ื ืชื•ื ื™ื ืžืžืงื•ืจื•ืช ืฆื™ื‘ื•ืจื™ื™ื ืฉืฉื™ืžืฉื• ืืช ืคื™ื™ืกื‘ื•ืง ืœื™ืฆื™ืจืช ืžื•ื“ืœ ื”-LAMA ืฉืœื” (ืกื”"ื› 1.25 ื˜ืจื™ืœื™ื•ืŸ ืืกื™ืžื•ื ื™ื), ืืš ืžืกื•ืคืงืช ื‘ืจื™ืฉื™ื•ืŸ ืคืชื•ื— ืฉืื™ื ื• ืžื’ื‘ื™ืœ ืืช ื”ื™ืงืฃ ื”ืฉื™ืžื•ืฉ (ื ืชื•ื ื™ LLaMA ื•ืžื•ื“ืœื™ื ืกื•ืคืงื• ืจืง ืœื—ื•ืงืจื™ื ืขืœ ื™ื“ื™ ืžื™ื•ื—ื“ื™ื ื‘ืงืฉื” ืœืฉื™ืžื•ืฉ ืœื ืžืกื—ืจื™). ื”ืกื˜ ืœื”ื•ืจื“ื” RedPajama-Data-1T ื”ื•ื 2.67 TB ื•ื›ื•ืœืœ ืžื™ื“ืข ืžื“ืคื™ ืื™ื ื˜ืจื ื˜ ืฉื ื•ืกืคื• ืœืื™ื ื“ืงืก Common Crawl, ืืจื›ื™ื•ื ื™ ื•ื™ืงื™ืคื“ื™ื”, ืงื•ื“ ืžืงื•ืจ ืž-GitHub, ืกืคืจื™ื ืฆื™ื‘ื•ืจื™ื™ื ืžืกืคืจื™ื™ืช ื’ื•ื˜ื ื‘ืจื’, ืžืืžืจื™ื ืžื“ืขื™ื™ื ืžืืจื›ื™ื•ืŸ ArXiv ื•ื“ื™ื•ื ื™ื ืขื Stack Overflow ื•ืฉืืจ Stack Overflow. ื”ื—ืœืคืช ืืชืจื™ื.

ืžื•ื“ืœื™ื ืžื•ื›ื ื™ื, ืฉื”ื•ื›ืฉืจื• ืขืœ ื‘ืกื™ืก ืžืขืจืš ื”ื ืชื•ื ื™ื ื”ืžื•ื›ื ื™ื ื•ืขื‘ืจื• ืื•ืคื˜ื™ืžื™ื–ืฆื™ื” ื‘ืืžืฆืขื•ืช ื“ื•ื’ืžืื•ืช ืžื•ื›ื ื•ืช ืฉืœ ื“ื™ืืœื•ื’ื™ื ื‘ืฆื•ืจื” ืฉืœ ื”ื•ืจืื”-ื‘ื™ืฆื•ืข ืžืคืจื•ื™ืงื˜ื™ Alpaca ื•-OpenChatKit, ืžืชื•ื›ื ื ื™ื ืœื”ื™ื•ื•ืฆืจ ื‘ืฉื‘ื•ืขื•ืช ื”ืงืจื•ื‘ื™ื. ื™ื•ื–ืžื•ืช ืžื•ื“ืœ ืฉืคื” ื“ื•ืžื•ืช ื›ื•ืœืœื•ืช ืืช ืคืจื•ื™ืงื˜ื™ ื”ืงื•ื“ ื”ืคืชื•ื— ื‘ื—ืœืงื LLaMA, Alpaca, Vicuna ื•-Koala, ื›ืžื• ื’ื ืืช ื™ื•ื–ืžื•ืช ื”ืงื•ื“ ื”ืคืชื•ื— ื”ืžืœืื•ืช Pythia, OpenChatKit, Open Assistant ื•ื“ื•ืœื™.

ื‘ื ื•ืกืฃ, ื™ืฉื ื ืžืกืคืจ ืคืจื•ื™ืงื˜ื™ื ื—ื“ืฉื™ื ื”ืงืฉื•ืจื™ื ืœืœืžื™ื“ืช ืžื›ื•ื ื”:

  • MiniGPT-4 - ืžืจื—ื™ื‘ ืฆ'ืื˜ื‘ื•ื˜ื™ื ืžืกื•ืจืชื™ื™ื ืœืฉื™ื—ื” ืขื ื™ื›ื•ืœื•ืช ืฉืœื•ืงื—ื•ืช ื‘ื—ืฉื‘ื•ืŸ ืžื™ื“ืข ื—ื–ื•ืชื™, ืžื” ืฉืžืืคืฉืจ ืœืš ืœื ืชื— ืชืžื•ื ื•ืช ื•ืœืงื—ืช ื‘ื—ืฉื‘ื•ืŸ ื˜ืงืกื˜ ื‘ื›ืชื‘ ื™ื“ ื‘ืชื”ืœื™ืš ื”ืื™ื ื˜ืจืืงืฆื™ื” ืขื ื”ืžืขืจื›ืช (ืœื“ื•ื’ืžื”, ืืชื” ื™ื›ื•ืœ ืœืฉืื•ืœ ืื™ื–ื” ืกื•ื’ ืฉืœ ืื•ื‘ื™ื™ืงื˜ ืžื•ืฆื’ ื‘ืชืžื•ื ื”, ื‘ืงืฉ ืžื”ื‘ื•ื˜ ืœื›ืชื•ื‘ ืกื™ืคื•ืจ ืขืœ ืกืžืš ืžื” ืฉืžื•ืฆื’ ื‘ืชืžื•ื ื”, ืื• ืขืœ ืกืžืš ืกืงื™ืฆื” ืกื›ืžื˜ื™ืช, ื‘ืงืฉ ืœื™ืฆื•ืจ ืืชืจ ืื™ื ื˜ืจื ื˜). ื”ืžื™ืžื•ืฉ ืฉืœ MiniGPT-4 ื ื›ืชื‘ ื‘-Python ื•ืžื•ืคืฅ ืชื—ืช ืจื™ืฉื™ื•ืŸ BSD.
  • ืคื™ื™ืกื‘ื•ืง ืคืจืกืžื” ืขืจื›ืช ื›ืœื™ื ื•ืžื•ื“ืœ ืœืžื™ื“ื” ืขืฆืžื™ืช (SSL, Self-Supervised Learning, ืื™ื ื• ืžืฉืชืžืฉ ื‘ืชื•ื•ื™ื•ืช ื•ื”ืขืจื•ืช ืฉื”ื•ื›ื ื• ืขืœ ื™ื“ื™ ืื“ื) DINOv2 ืžื•ื“ืœ ืจืื™ื™ืช ืžื›ื•ื ื” ื”ืžืชืื™ื ืœืคืชืจื•ืŸ ื‘ืขื™ื•ืช ืฉืœ ืขื™ื‘ื•ื“ ืžื™ื“ืข ื—ื–ื•ืชื™ ื›ืœืœื™ (ืกื™ื•ื•ื’ ืชืžื•ื ื”, ื—ื™ืœื•ืฅ ืžื™ื“ืข ืขืœ ืื•ื‘ื™ื™ืงื˜ื™ื ื‘ ืชืžื•ื ื•ืช, ื”ื‘ื ืช ืžื” ืงื•ืจื” ื‘ื•ื•ื™ื“ืื•) ื•ืžื ื™ืคื•ืœืฆื™ื•ืช ื‘ืจืžืช ื”ืคื™ืงืกืœื™ื (ื—ื™ื–ื•ื™ ืขื•ืžืง, ืคื™ืœื•ื—). ื”ื“ื’ื ืžืื•ืžืŸ ืขืœ ืื•ืกืฃ ืฉืœ 142 ืžื™ืœื™ื•ืŸ ืชืžื•ื ื•ืช. ื”ืžื™ืžื•ืฉ ื ื›ืชื‘ ื‘-Python ื•ืžื•ืคืฅ ืชื—ืช ืจื™ืฉื™ื•ืŸ Creative Commons Attribution-NonCommercial 4.0 ื”ืžืืคืฉืจ ืฉื™ืžื•ืฉ ืœื ืžืกื—ืจื™.
  • GPT4All ื”ื•ื ืขืจื›ืช ื›ืœื™ื ืœื”ืคืขืœื” ืžื”ื™ืจื” ืฉืœ ืฆ'ืื˜ื‘ื•ื˜ื™ื ืขืฆืžืื™ื™ื ืขืœ ื”ื—ื•ืžืจื” ืฉืœื”ื (ื”ื ืœื ื ื™ื’ืฉื™ื ืœืฉื™ืจื•ืชื™ื ื—ื™ืฆื•ื ื™ื™ื ื•ืžืฉืชืžืฉื™ื ื‘ืžืขื‘ื“ื™ื ืขื ืชืžื™ื›ื” ื‘-AVX2 ืœื‘ื™ืฆื•ืข). ื—ื™ื‘ื•ืจ ื“ื’ืžื™ ืฉืคื” ื’ื“ื•ืœื™ื ื”ืžื‘ื•ืกืกื™ื ืขืœ GPT-J ื•-LLaMa ื ืชืžืš. ื”ืงื•ื“ ื›ืชื•ื‘ ื‘-Python ื•ืžื•ืคืฅ ืชื—ืช ืจื™ืฉื™ื•ืŸ MIT.

ืžืงื•ืจ: OpenNet.ru

ื”ื•ืกืคืช ืชื’ื•ื‘ื”