ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”

ืฉื•ืง ื”ืžื—ืฉื•ื‘ ื”ืžื‘ื•ื–ืจ ื•ื”ื‘ื™ื’ ื“ืื˜ื”, ืœืคื™ ื ืชื•ื ื™ื ืกื˜ื˜ื™ืกื˜ื™ื™ื, ื’ื“ืœ ื‘-18-19% ื‘ืฉื ื”. ื”ืžืฉืžืขื•ืช ื”ื™ื ืฉื ื•ืฉื ื‘ื—ื™ืจืช ื”ืชื•ื›ื ื” ืœืžื˜ืจื•ืช ืืœื• ื ื•ืชืจ ืจืœื•ื•ื ื˜ื™. ื‘ืคื•ืกื˜ ื–ื”, ื ืชื—ื™ืœ ืžื“ื•ืข ื™ืฉ ืฆื•ืจืš ื‘ืžื—ืฉื•ื‘ ืžื‘ื•ื–ืจ, ื ืคืจื˜ ื™ื•ืชืจ ืขืœ ื‘ื—ื™ืจืช ืชื•ื›ื ื”, ื ื“ื‘ืจ ืขืœ ืฉื™ืžื•ืฉ ื‘-Hadoop ื‘ืืžืฆืขื•ืช Cloudera, ื•ืœื‘ืกื•ืฃ ื ื“ื‘ืจ ืขืœ ื‘ื—ื™ืจืช ื”ื—ื•ืžืจื” ื•ื›ื™ืฆื“ ื”ื™ื ืžืฉืคื™ืขื” ืขืœ ื”ื‘ื™ืฆื•ืขื™ื ื‘ื“ืจื›ื™ื ืฉื•ื ื•ืช.

ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”
ืžื“ื•ืข ื™ืฉ ืฆื•ืจืš ื‘ืžื—ืฉื•ื‘ ืžื‘ื•ื–ืจ ื‘ืขืกืงื™ื ืจื’ื™ืœื™ื? ื”ื›ืœ ื›ืืŸ ืคืฉื•ื˜ ื•ืžืกื•ื‘ืš ื‘ื• ื–ืžื ื™ืช. ืคืฉื•ื˜ โ€“ ื›ื™ ื‘ืจื•ื‘ ื”ืžืงืจื™ื ืื ื• ืžื‘ืฆืขื™ื ื—ื™ืฉื•ื‘ื™ื ืคืฉื•ื˜ื™ื ื™ื—ืกื™ืช ืœื™ื—ื™ื“ืช ืžื™ื“ืข. ื–ื” ืงืฉื” ื›ื™ ื™ืฉ ื”ืจื‘ื” ืžื™ื“ืข ื›ื–ื”. ื›ืœ ื›ืš ื”ืจื‘ื”. ื›ืชื•ืฆืื” ืžื›ืš, ื–ื” ื”ื›ืจื—ื™ ืœืขื‘ื“ ื˜ืจื”-ื‘ื™ื™ื˜ ืฉืœ ื ืชื•ื ื™ื ื‘-1000 ืฉืจืฉื•ืจื™ื. ืœืคื™ื›ืš, ืžืงืจื™ ื”ืฉื™ืžื•ืฉ ื”ื ืื•ื ื™ื‘ืจืกืœื™ื™ื ืœืžื“ื™: ื ื™ืชืŸ ืœื”ืฉืชืžืฉ ื‘ื—ื™ืฉื•ื‘ื™ื ื‘ื›ืœ ืžืงื•ื ื‘ื• ื™ืฉ ืฆื•ืจืš ืœืงื—ืช ื‘ื—ืฉื‘ื•ืŸ ืžืกืคืจ ืจื‘ ืฉืœ ืžื“ื“ื™ื ืขืœ ืžืขืจืš ื ืชื•ื ื™ื ื’ื“ื•ืœ ืขื•ื“ ื™ื•ืชืจ.

ืื—ืช ื”ื“ื•ื’ืžืื•ืช ื”ืื—ืจื•ื ื•ืช: ืจืฉืช ื”ืคื™ืฆืจื™ื•ืช ื“ื•ื“ื• ืคื™ืฆื” ืžื•ึผื’ื“ึธืจ ื‘ื”ืชื‘ืกืก ืขืœ ื ื™ืชื•ื— ืžืื’ืจ ื”ื”ื–ืžื ื•ืช ืฉืœ ื”ืœืงื•ื—ื•ืช, ืฉื›ืืฉืจ ื‘ื•ื—ืจื™ื ืคื™ืฆื” ืขื ืชื•ืกืคืช ืืงืจืื™ืช, ืžืฉืชืžืฉื™ื ื‘ื“ืจืš ื›ืœืœ ืคื•ืขืœื™ื ืขื ืฉื™ืฉื” ืกื˜ื™ื ื‘ืกื™ืกื™ื™ื ืฉืœ ืžืจื›ื™ื‘ื™ื ื•ืขื•ื“ ื›ืžื” ืืงืจืื™ื™ื. ื‘ื”ืชืื ืœื›ืš ื”ืชืื™ืžื” ื”ืคื™ืฆืจื™ื” ืืช ืงื ื™ื•ืชื™ื”. ื‘ื ื•ืกืฃ, ื”ื™ื ื”ืฆืœื™ื—ื” ืœื”ืžืœื™ืฅ โ€‹โ€‹ื˜ื•ื‘ ื™ื•ืชืจ ืขืœ ืžื•ืฆืจื™ื ื ื•ืกืคื™ื ืฉื”ื•ืฆืขื• ืœืžืฉืชืžืฉื™ื ื‘ืฉืœื‘ ื”ื”ื–ืžื ื”, ืžื” ืฉื”ื’ื“ื™ืœ ืืช ื”ืจื•ื•ื—ื™ื.

ื“ื•ื’ืžื” ื ื•ืกืคืช: ะฐะฝะฐะปะธะท ืคืจื™ื˜ื™ ื”ืžื•ืฆืจ ืืคืฉืจื• ืœื—ื ื•ืช H&M ืœืฆืžืฆื ืืช ื”ืžื‘ื—ืจ ื‘ื—ื ื•ื™ื•ืช ื‘ื•ื“ื“ื•ืช ื‘-40%, ืชื•ืš ืฉืžื™ืจื” ืขืœ ืจืžื•ืช ืžื›ื™ืจื•ืช. ื–ื” ื”ื•ืฉื’ ืขืœ ื™ื“ื™ ืื™ ื”ื›ืœืœืช ืคืจื™ื˜ื™ื ืฉื ืžื›ืจื• ื‘ืฆื•ืจื” ื’ืจื•ืขื”, ื•ื”ืขื•ื ืชื™ื•ืช ื ืœืงื—ื” ื‘ื—ืฉื‘ื•ืŸ ื‘ื—ื™ืฉื•ื‘ื™ื.

ื‘ื—ื™ืจืช ื›ืœื™

ื”ืชืงืŸ ื”ืชืขืฉื™ื™ื” ืขื‘ื•ืจ ืกื•ื’ ื–ื” ืฉืœ ืžื—ืฉื•ื‘ ื”ื•ื Hadoop. ืœืžื”? ื›ื™ Hadoop ื”ื™ื ืžืกื’ืจืช ืžืฆื•ื™ื ืช ื•ืžืชื•ืขื“ืช ื”ื™ื˜ื‘ (ืื•ืชื• Habr ืžืกืคืง ืžืืžืจื™ื ืžืคื•ืจื˜ื™ื ืจื‘ื™ื ื‘ื ื•ืฉื ื–ื”), ื”ืžืœื•ื•ื” ื‘ืžืขืจืš ืฉืœื ืฉืœ ื›ืœื™ ืฉื™ืจื•ืช ื•ืกืคืจื™ื•ืช. ืืชื” ื™ื›ื•ืœ ืœื”ื–ื™ืŸ ืกื˜ื™ื ืขืฆื•ืžื™ื ืฉืœ ื ืชื•ื ื™ื ืžื•ื‘ื ื™ื ื•ื’ื ืœื ืžื•ื‘ื ื™ื, ื•ื”ืžืขืจื›ืช ืขืฆืžื” ืชืคื™ืฅ ืื•ืชื ื‘ื™ืŸ ื›ื•ื— ื”ืžื—ืฉื•ื‘. ื™ืชืจ ืขืœ ื›ืŸ, ื ื™ืชืŸ ืœื”ื’ื“ื™ืœ ืื• ืœื”ืฉื‘ื™ืช ืืช ืื•ืชืŸ ื™ื›ื•ืœื•ืช ื‘ื›ืœ ืขืช - ืื•ืชื” ืžื“ืจื’ื™ื•ืช ืื•ืคืงื™ืช ื‘ืคืขื•ืœื”.

ื‘ืฉื ืช 2017, ื—ื‘ืจืช ื”ื™ื™ืขื•ืฅ ื”ืžืฉืคื™ืขื” ื’ืจื˜ื ืจ ืกื™ื›ืืฉ-Hadoop ืชืชื™ื™ืฉืŸ ื‘ืงืจื•ื‘. ื”ืกื™ื‘ื” ื‘ื ืืœื™ืช ืœืžื“ื™: ืื ืœื™ืกื˜ื™ื ืžืืžื™ื ื™ื ืฉื—ื‘ืจื•ืช ื™ืขื‘ืจื• ื‘ื”ืžื•ื ื™ื”ื ืœืขื ืŸ, ืฉื›ืŸ ืฉื ื™ื•ื›ืœื• ืœืฉืœื ืชื•ืš ืฉื™ืžื•ืฉ ื‘ื›ื•ื— ืžื—ืฉื•ื‘. ื”ื’ื•ืจื ื”ื—ืฉื•ื‘ ื”ืฉื ื™ ืฉื™ื›ื•ืœ ื›ื‘ื™ื›ื•ืœ "ืœืงื‘ื•ืจ" ืืช Hadoop ื”ื•ื ื”ืžื”ื™ืจื•ืช ืฉืœื•. ืžื›ื™ื•ื•ืŸ ืฉืืคืฉืจื•ื™ื•ืช ื›ืžื• Apache Spark ืื• Google Cloud DataFlow ืžื”ื™ืจื•ืช ื™ื•ืชืจ ืž- MapReduce, ืฉืขื•ืžื“ ื‘ื‘ืกื™ืก Hadoop.

Hadoop ื ืฉืขื ืช ืขืœ ื›ืžื” ืขืžื•ื“ื™ื, ื›ืฉื”ื‘ื•ืœื˜ื™ื ืฉื‘ื”ื ื”ื ื˜ื›ื ื•ืœื•ื’ื™ื•ืช MapReduce (ืžืขืจื›ืช ืœื”ืคืฆืช ื ืชื•ื ื™ื ืœื—ื™ืฉื•ื‘ื™ื ื‘ื™ืŸ ืฉืจืชื™ื) ื•ืžืขืจื›ืช ื”ืงื‘ืฆื™ื HDFS. ื”ืื—ืจื•ืŸ ืชื•ื›ื ืŸ ื‘ืžื™ื•ื—ื“ ืœืื—ืกื•ืŸ ืžื™ื“ืข ื”ืžื•ืคืฅ ื‘ื™ืŸ ืฆืžืชื™ื ืฉืœ ืืฉื›ื•ืœื•ืช: ื›ืœ ื‘ืœื•ืง ื‘ื’ื•ื“ืœ ืงื‘ื•ืข ื™ื›ื•ืœ ืœื”ื™ื•ืช ืžืžื•ืงื ืขืœ ืžืกืคืจ ืฆืžืชื™ื, ื•ื‘ื–ื›ื•ืช ืฉื›ืคื•ืœ, ื”ืžืขืจื›ืช ืขืžื™ื“ื” ื‘ืคื ื™ ื›ืฉืœื™ื ืฉืœ ืฆืžืชื™ื ื‘ื•ื“ื“ื™ื. ื‘ืžืงื•ื ื˜ื‘ืœืช ืงื‘ืฆื™ื, ื ืขืฉื” ืฉื™ืžื•ืฉ ื‘ืฉืจืช ืžื™ื•ื—ื“ ื‘ืฉื NameNode.

ื”ืื™ื•ืจ ืฉืœื”ืœืŸ ืžืจืื” ื›ื™ืฆื“ MapReduce ืคื•ืขืœ. ื‘ืฉืœื‘ ื”ืจืืฉื•ืŸ ืžื—ืœืงื™ื ืืช ื”ื ืชื•ื ื™ื ืœืคื™ ืงืจื™ื˜ืจื™ื•ืŸ ืžืกื•ื™ื, ื‘ืฉืœื‘ ื”ืฉื ื™ ืžื—ื•ืœืงื™ื ืœืคื™ ื›ื•ื— ืžื—ืฉื•ื‘ ื•ื‘ืฉืœื‘ ื”ืฉืœื™ืฉื™ ืžืชื‘ืฆืข ื”ื—ื™ืฉื•ื‘.

ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”
MapReduce ื ื•ืฆืจ ื‘ืžืงื•ืจ ืขืœ ื™ื“ื™ ื’ื•ื’ืœ ืœืฆืจื›ื™ ื”ื—ื™ืคื•ืฉ ืฉืœื”. ืœืื—ืจ ืžื›ืŸ, MapReduce ืขื‘ืจ ืงื•ื“ ื—ื™ื ื, ื•- Apache ื”ืฉืชืœื˜ ืขืœ ื”ืคืจื•ื™ืงื˜. ื•ื‘ื›ืŸ, ื’ื•ื’ืœ ืขื‘ืจื” ื‘ื”ื“ืจื’ื” ืœืคืชืจื•ื ื•ืช ืื—ืจื™ื. ืžื™ื“ืข ืžืขื ื™ื™ืŸ: ืœื’ื•ื’ืœ ื™ืฉ ื›ืจื’ืข ืคืจื•ื™ืงื˜ ื‘ืฉื Google Cloud Dataflow, ื”ืžื•ืฆื‘ ื›ืฉืœื‘ ื”ื‘ื ืื—ืจื™ Hadoop, ื›ืชื—ืœื™ืฃ ืžื”ื™ืจ ืขื‘ื•ืจื•.

ืžื‘ื˜ ืžืงืจื•ื‘ ืžืจืื” ืฉ-Google Cloud Dataflow ืžื‘ื•ืกืก ืขืœ ื•ืจื™ืืฆื™ื” ืฉืœ Apache Beam, ื‘ืขื•ื“ Apache Beam ื›ื•ืœืœ ืืช ื”ืžืกื’ืจืช ื”ืžืชื•ืขื“ืช ื”ื™ื˜ื‘ ืฉืœ Apache Spark, ื”ืžืืคืฉืจืช ืœื ื• ืœื“ื‘ืจ ืขืœ ืื•ืชื” ืžื”ื™ืจื•ืช ื‘ื™ืฆื•ืข ื›ืžืขื˜ ืฉืœ ืคืชืจื•ื ื•ืช. ื•ื‘ื›ืŸ, Apache Spark ืขื•ื‘ื“ ื‘ืฆื•ืจื” ืžื•ืฉืœืžืช ืขืœ ืžืขืจื›ืช ื”ืงื‘ืฆื™ื HDFS, ื”ืžืืคืฉืจืช ืœืคืจื•ืก ืื•ืชื” ืขืœ ืฉืจืชื™ Hadoop.

ื”ื•ืกืฃ ื›ืืŸ ืืช ื ืคื— ื”ืชื™ืขื•ื“ ื•ื”ืคืชืจื•ื ื•ืช ื”ืžื•ื›ื ื™ื ืขื‘ื•ืจ Hadoop ื•-Spark ืžื•ืœ Google Cloud Dataflow, ื•ื”ื‘ื—ื™ืจื” ื‘ื›ืœื™ ื”ื•ืคื›ืช ื‘ืจื•ืจื”. ื™ืชืจื” ืžื›ืš, ืžื”ื ื“ืกื™ื ื™ื›ื•ืœื™ื ืœื”ื—ืœื™ื˜ ื‘ืขืฆืžื ืื™ื–ื” ืงื•ื“ - ืขื‘ื•ืจ Hadoop ืื• Spark - ืขืœื™ื”ื ืœื”ืคืขื™ืœ, ืชื•ืš ื”ืชืžืงื“ื•ืช ื‘ืžืฉื™ืžื”, ื”ื ื™ืกื™ื•ืŸ ื•ื”ื›ื™ืฉื•ืจื™ื.

ืขื ืŸ ืื• ืฉืจืช ืžืงื•ืžื™

ื”ืžื’ืžื” ืœืžืขื‘ืจ ื›ืœืœื™ ืœืขื ืŸ ืืฃ ื”ื•ืœื™ื“ื” ืžื•ื ื— ืžืขื ื™ื™ืŸ ื›ืžื• Hadoop-as-a-service. ื‘ืชืจื—ื™ืฉ ื›ื–ื”, ื”ื ื™ื”ื•ืœ ืฉืœ ืฉืจืชื™ื ืžื—ื•ื‘ืจื™ื ื”ืคืš ื—ืฉื•ื‘ ืžืื•ื“. ื›ื™, ืœืžืจื‘ื” ื”ืฆืขืจ, ืœืžืจื•ืช ื”ืคื•ืคื•ืœืจื™ื•ืช ืฉืœื•, Hadoop ื˜ื”ื•ืจ ื”ื•ื ื›ืœื™ ืงืฉื” ืœืžื“ื™ ืœื”ื’ื“ืจื”, ืžื›ื™ื•ื•ืŸ ืฉืฆืจื™ืš ืœืขืฉื•ืช ื”ืจื‘ื” ื™ื“ื ื™ืช. ืœื“ื•ื’ืžื”, ืœื”ื’ื“ื™ืจ ืฉืจืชื™ื ื‘ื ืคืจื“, ืœืคืงื— ืขืœ ื”ื‘ื™ืฆื•ืขื™ื ืฉืœื”ื ื•ืœื”ื’ื“ื™ืจ ื‘ืงืคื™ื“ื” ืคืจืžื˜ืจื™ื ืจื‘ื™ื. ื‘ืื•ืคืŸ ื›ืœืœื™, ื”ืขื‘ื•ื“ื” ืžื™ื•ืขื“ืช ืœื—ื•ื‘ื‘ืŸ ื•ื™ืฉ ืกื™ื›ื•ื™ ื’ื“ื•ืœ ืœื”ืชื‘ืœื‘ืœ ืื™ืคืฉื”ื• ืื• ืœืคืกืคืก ืžืฉื”ื•.

ืœื›ืŸ, ืขืจื›ื•ืช ื”ืคืฆื” ืฉื•ื ื•ืช, ืืฉืจ ืžืฆื•ื™ื“ื•ืช ื‘ืชื—ื™ืœื” ื‘ื›ืœื™ ืคืจื™ืกื” ื•ื ื™ื”ื•ืœ ื ื•ื—ื™ื, ื”ืคื›ื• ืคื•ืคื•ืœืจื™ื•ืช ืžืื•ื“. ืื—ืช ื”ื”ืคืฆื•ืช ื”ืคื•ืคื•ืœืจื™ื•ืช ื‘ื™ื•ืชืจ ืฉืชื•ืžื›ื•ืช ื‘ืกืคืืจืง ื•ืžืงืœื•ืช ืขืœ ื”ื›ืœ ื”ื™ื Cloudera. ื™ืฉ ืœื• ื’ื ื’ืจืกืื•ืช ื‘ืชืฉืœื•ื ื•ื’ื ื—ื™ื ืžื™ื•ืช - ื•ื‘ืื—ืจื•ืŸ ื›ืœ ื”ืคื•ื ืงืฆื™ื•ื ืœื™ื•ืช ื”ื‘ืกื™ืกื™ืช ื–ืžื™ื ื”, ืžื‘ืœื™ ืœื”ื’ื‘ื™ืœ ืืช ืžืกืคืจ ื”ืฆืžืชื™ื.

ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”

ื‘ืžื”ืœืš ื”ื”ื’ื“ืจื”, Cloudera Manager ื™ืชื—ื‘ืจ ื‘ืืžืฆืขื•ืช SSH ืœืฉืจืชื™ื ืฉืœืš. ื ืงื•ื“ื” ืžืขื ื™ื™ื ืช: ื‘ืขืช ื”ื”ืชืงื ื”, ืขื“ื™ืฃ ืœืฆื™ื™ืŸ ืฉื–ื” ื™ื‘ื•ืฆืข ืขืœ ื™ื“ื™ ืžื” ืฉื ืงืจื ืคื˜ืจื™ื•ืช: ื—ื‘ื™ืœื•ืช ืžื™ื•ื—ื“ื•ืช, ืฉื›ืœ ืื—ืช ืžื”ืŸ ืžื›ื™ืœื” ืืช ื›ืœ ื”ืจื›ื™ื‘ื™ื ื”ื“ืจื•ืฉื™ื ื”ืžื•ื’ื“ืจื™ื ืœืขื‘ื•ื“ ื–ื” ืขื ื–ื”. ื‘ืขื™ืงืจื•ืŸ ื–ื•ื”ื™ ื’ืจืกื” ืžืฉื•ืคืจืช ืฉืœ ืžื ื”ืœ ื”ื—ื‘ื™ืœื•ืช.

ืœืื—ืจ ื”ื”ืชืงื ื”, ืื ื• ืžืงื‘ืœื™ื ืžืกื•ืฃ ื ื™ื”ื•ืœ ืืฉื›ื•ืœื•ืช, ืฉื‘ื• ืืชื” ื™ื›ื•ืœ ืœืจืื•ืช ื˜ืœืžื˜ืจื™ื™ืช ืืฉื›ื•ืœื•ืช, ืฉื™ืจื•ืชื™ื ืžื•ืชืงื ื™ื, ื‘ื ื•ืกืฃ ืชื•ื›ืœ ืœื”ื•ืกื™ืฃ/ืœื”ืกื™ืจ ืžืฉืื‘ื™ื ื•ืœืขืจื•ืš ืืช ืชืฆื•ืจืช ื”ืืฉื›ื•ืœื•ืช.

ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”

ื›ืชื•ืฆืื” ืžื›ืš, ืชื ื”ื˜ื™ืœ ืฉื™ื™ืงื— ืืชื›ื ืœืขืชื™ื“ ื”ื–ื•ื”ืจ ืฉืœ BigData ืžื•ืคื™ืข ืœืคื ื™ื›ื. ืื‘ืœ ืœืคื ื™ ืฉื ืืžืจ "ื‘ื•ื ื ืœืš", ื‘ื•ืื• ื ืขื‘ื•ืจ ืžืชื—ืช ืœืžื›ืกื” ื”ืžื ื•ืข.

ื“ืจื™ืฉื•ืช ื—ื•ืžืจื”

ื‘ืืชืจ ื”ืื™ื ื˜ืจื ื˜ ืฉืœื”, Cloudera ืžื–ื›ื™ืจื” ืชืฆื•ืจื•ืช ืืคืฉืจื™ื•ืช ืฉื•ื ื•ืช. ื”ืขืงืจื•ื ื•ืช ื”ื›ืœืœื™ื™ื ืฉืœืคื™ื”ื ื”ื ื‘ื ื•ื™ื™ื ืžื•ืฆื’ื™ื ื‘ืื™ื•ืจ:

ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”
MapReduce ื™ื›ื•ืœ ืœื˜ืฉื˜ืฉ ืืช ื”ืชืžื•ื ื” ื”ืื•ืคื˜ื™ืžื™ืช ื”ื–ื•. ืื ืžืกืชื›ืœื™ื ืฉื•ื‘ ืขืœ ื”ื“ื™ืื’ืจืžื” ืžื”ืกืขื™ืฃ ื”ืงื•ื“ื, ืžืชื‘ืจืจ ืฉื›ืžืขื˜ ื‘ื›ืœ ื”ืžืงืจื™ื, ืขื‘ื•ื“ืช MapReduce ื™ื›ื•ืœื” ืœื”ื™ืชืงืœ ื‘ืฆื•ื•ืืจ ื‘ืงื‘ื•ืง ื‘ืขืช ืงืจื™ืืช ื ืชื•ื ื™ื ืžื”ื“ื™ืกืง ืื• ืžื”ืจืฉืช. ื–ื” ืžืฆื•ื™ืŸ ื’ื ื‘ื‘ืœื•ื’ Cloudera. ื›ืชื•ืฆืื” ืžื›ืš, ืขื‘ื•ืจ ื›ืœ ื—ื™ืฉื•ื‘ ืžื”ื™ืจ, ื›ื•ืœืœ ื“ืจืš Spark, ื”ืžืฉืžืฉืช ืœืขืชื™ื ืงืจื•ื‘ื•ืช ืœื—ื™ืฉื•ื‘ื™ื ื‘ื–ืžืŸ ืืžืช, ืžื”ื™ืจื•ืช ื”ืงืœื˜/ืคืœื˜ ื—ืฉื•ื‘ื” ืžืื•ื“. ืœื›ืŸ, ื‘ืฉื™ืžื•ืฉ ื‘-Hadoop, ื—ืฉื•ื‘ ืžืื•ื“ ืฉื”ืืฉื›ื•ืœ ื™ื›ืœื•ืœ ืžื›ื•ื ื•ืช ืžืื•ื–ื ื•ืช ื•ืžื”ื™ืจื•ืช, ืฉื‘ืœืฉื•ืŸ ื”ืžืขื˜ื”, ืœื ืชืžื™ื“ ืžื•ื‘ื˜ื— ื‘ืชืฉืชื™ืช ื”ืขื ืŸ.

ืื™ื–ื•ืŸ ื‘ื—ืœื•ืงืช ื”ืขื•ืžืก ืžื•ืฉื’ ื‘ืืžืฆืขื•ืช ืฉื™ืžื•ืฉ ื‘ื•ื•ื™ืจื˜ื•ืืœื™ื–ืฆื™ื” ืฉืœ Openstack ื‘ืฉืจืชื™ื ืขื ืžืขื‘ื“ื™ื ืจื‘ื™-ืœื™ื‘ื•ืช ืจื‘ื™ ืขื•ืฆืžื”. ืœืฆืžืชื™ ื ืชื•ื ื™ื ืžื•ืงืฆื™ื ืžืฉืื‘ื™ ืžืขื‘ื“ ืžืฉืœื”ื ื•ื“ื™ืกืงื™ื ืกืคืฆื™ืคื™ื™ื. ื‘ื”ื—ืœื˜ื” ืฉืœื ื• Atos Codex Data Lake Engine ืžื•ืฉื’ืช ื•ื™ืจื˜ื•ืืœื™ื–ืฆื™ื” ืจื—ื‘ื”, ื•ื–ื• ื”ืกื™ื‘ื” ืฉืื ื• ืžืจื•ื•ื™ื—ื™ื ื”ืŸ ืžื‘ื—ื™ื ืช ื‘ื™ืฆื•ืขื™ื (ื”ืฉืคืขืช ืชืฉืชื™ืช ื”ืจืฉืช ืžืžื•ื–ืขืจืช) ื•ื”ืŸ ื‘-TCO (ืžืชื‘ื˜ืœื™ื ืฉืจืชื™ื ืคื™ื–ื™ื™ื ื ื•ืกืคื™ื).

ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”
ื‘ืขืช ืฉื™ืžื•ืฉ ื‘ืฉืจืชื™ BullSequana S200, ืื ื• ืžืงื‘ืœื™ื ืขื•ืžืก ืื—ื™ื“ ืžืื•ื“, ื ื˜ื•ืœ ื›ืžื” ืฆื•ื•ืืจื™ ื‘ืงื‘ื•ืง. ื”ืชืฆื•ืจื” ื”ืžื™ื ื™ืžืœื™ืช ื›ื•ืœืœืช 3 ืฉืจืชื™ BullSequana S200, ื›ืœ ืื—ื“ ืขื ืฉื ื™ JBODs, ื‘ืชื•ืกืคืช S200s ื ื•ืกืคื™ื ื”ืžื›ื™ืœื™ื ืืจื‘ืขื” ืฆืžืชื™ ื ืชื•ื ื™ื ืžื—ื•ื‘ืจื™ื ื‘ืื•ืคืŸ ืื•ืคืฆื™ื•ื ืœื™. ื”ื ื” ื“ื•ื’ืžื” ืœืขื•ืžืก ื‘ืžื‘ื—ืŸ TeraGen:

ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”

ื‘ื“ื™ืงื•ืช ืขื ื ืคื—ื™ ื ืชื•ื ื™ื ื•ืขืจื›ื™ ืฉื›ืคื•ืœ ืฉื•ื ื™ื ืžืฆื™ื’ื•ืช ืืช ืื•ืชืŸ ืชื•ืฆืื•ืช ื‘ืžื•ื ื—ื™ื ืฉืœ ื—ืœื•ืงืช ืขื•ืžืกื™ื ื‘ื™ืŸ ืฆืžืชื™ ืืฉื›ื•ืœ. ืœื”ืœืŸ ื’ืจืฃ ืฉืœ ื”ืชืคืœื’ื•ืช ื”ื’ื™ืฉื” ืœื“ื™ืกืง ืœืคื™ ืžื‘ื—ื ื™ ื‘ื™ืฆื•ืขื™ื.

ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”

ื—ื™ืฉื•ื‘ื™ื ื‘ื•ืฆืขื• ืขืœ ืกืžืš ืชืฆื•ืจื” ืžื™ื ื™ืžืœื™ืช ืฉืœ 3 ืฉืจืชื™ BullSequana S200. ื”ื•ื ื›ื•ืœืœ 9 ืฆืžืชื™ ื ืชื•ื ื™ื ื•-3 ืฆืžืชื™ื ืžืืกื˜ืจ, ื›ืžื• ื’ื ืžื›ื•ื ื•ืช ื•ื™ืจื˜ื•ืืœื™ื•ืช ืฉืžื•ืจื•ืช ื‘ืžืงืจื” ืฉืœ ืคืจื™ืกืช ื”ื’ื ื” ื”ืžื‘ื•ืกืกืช ืขืœ OpenStack Virtualization. ืชื•ืฆืืช ื‘ื“ื™ืงืช TeraSort: ื’ื•ื“ืœ ื‘ืœื•ืง 512 MB ืžืงื“ื ืฉื›ืคื•ืœ ืฉื•ื•ื” ืœืฉืœื•ืฉ ืขื ื”ืฆืคื ื” ื”ื•ื 23,1 ื“ืงื•ืช.

ื›ื™ืฆื“ ื ื™ืชืŸ ืœื”ืจื—ื™ื‘ ืืช ื”ืžืขืจื›ืช? ื™ืฉื ื ืกื•ื’ื™ื ืฉื•ื ื™ื ืฉืœ ื”ืจื—ื‘ื•ืช ื–ืžื™ื ื•ืช ืขื‘ื•ืจ Data Lake Engine:

  • ืฆืžืชื™ ื ืชื•ื ื™ื: ืขื‘ื•ืจ ื›ืœ 40 TB ืฉืœ ืฉื˜ื— ืฉืžื™ืฉ
  • ืฆืžืชื™ื ืื ืœื™ื˜ื™ื™ื ืขื ื™ื›ื•ืœืช ืœื”ืชืงื™ืŸ GPU
  • ืืคืฉืจื•ื™ื•ืช ืื—ืจื•ืช ื‘ื”ืชืื ืœืฆืจื›ื™ื ื”ืขืกืงื™ื™ื (ืœื“ื•ื’ืžื”, ืื ืืชื” ืฆืจื™ืš ืงืคืงื ื•ื›ื“ื•ืžื”)

ืžื” ืžื™ื•ื—ื“ ื‘Cloudera ื•ืื™ืš ืžื›ื™ื ื™ื ืื•ืชื”

Atos Codex Data Lake Engine ื›ื•ืœืœ ื’ื ืืช ื”ืฉืจืชื™ื ืขืฆืžื ื•ื’ื ืชื•ื›ื ื•ืช ืžื•ืชืงื ื•ืช ืžืจืืฉ, ื›ื•ืœืœ ืขืจื›ืช Cloudera ืžื•ืจืฉื™ืช; Hadoop ืขืฆืžื”, OpenStack ืขื ืžื›ื•ื ื•ืช ื•ื™ืจื˜ื•ืืœื™ื•ืช ื”ืžื‘ื•ืกืกื•ืช ืขืœ ืœื™ื‘ืช RedHat Enterprise Linux, ืžืขืจื›ื•ืช ืฉื›ืคื•ืœ ื•ื’ื™ื‘ื•ื™ ื ืชื•ื ื™ื (ื›ื•ืœืœ ืฉื™ืžื•ืฉ ื‘ืฆื•ืžืช ื’ื™ื‘ื•ื™ ื•-Cloudera BDR - Backup and Disaster Recovery). Atos Codex Data Lake Engine ื”ืคืš ืœืคืชืจื•ืŸ ื”ื•ื™ืจื˜ื•ืืœื™ื–ืฆื™ื” ื”ืจืืฉื•ืŸ ืฉืงื™ื‘ืœ ื”ืกืžื›ื” ืงืœื•ื“ืจื”.

ืื ืืชื ืžืขื•ื ื™ื™ื ื™ื ื‘ืคืจื˜ื™ื, ื ืฉืžื— ืœืขื ื•ืช ืขืœ ืฉืืœื•ืชื™ื ื• ื‘ืชื’ื•ื‘ื•ืช.

ืžืงื•ืจ: www.habr.com

ื”ื•ืกืคืช ืชื’ื•ื‘ื”