ื›ื™ืฆื“ BigQuery ืฉืœ Google ื‘ื™ืฆืขื” ื“ืžื•ืงืจื˜ื™ื” ืฉืœ ื ื™ืชื•ื— ื ืชื•ื ื™ื. ื—ืœืง 1

ืฉืœื•ื, ื”ื‘ืจ! ื”ื”ืจืฉืžื” ืœื–ืจื ืงื•ืจืก ื—ื“ืฉ ืคืชื•ื—ื” ื›ืขืช ื‘-OTUS ืžื”ื ื“ืก ื ืชื•ื ื™ื. ืœืงืจืืช ืชื—ื™ืœืช ื”ืงื•ืจืก, ื”ื›ื ื• ืขื‘ื•ืจื›ื ื‘ืื•ืคืŸ ืžืกื•ืจืชื™ ืชืจื’ื•ื ืฉืœ ื—ื•ืžืจ ืžืขื ื™ื™ืŸ.

ืžื“ื™ ื™ื•ื, ื™ื•ืชืจ ืžืžืื” ืžื™ืœื™ื•ืŸ ืื ืฉื™ื ืžื‘ืงืจื™ื ื‘ื˜ื•ื•ื™ื˜ืจ ื›ื“ื™ ืœื’ืœื•ืช ืžื” ืงื•ืจื” ื‘ืขื•ืœื ื•ืœื“ื•ืŸ ื‘ื–ื”. ื›ืœ ืฆื™ื•ืฅ ื•ื›ืœ ืคืขื•ืœืช ืžืฉืชืžืฉ ืื—ืจืช ืžื™ื™ืฆืจื™ื ืื™ืจื•ืข ืฉื–ืžื™ืŸ ืœื ื™ืชื•ื— ื”ื ืชื•ื ื™ื ื”ืคื ื™ืžื™ ืฉืœ ื˜ื•ื•ื™ื˜ืจ. ืžืื•ืช ืขื•ื‘ื“ื™ื ืžื ืชื—ื™ื ื•ืžื“ืžื™ื™ื ื™ื ื ืชื•ื ื™ื ืืœื”, ื•ืฉื™ืคื•ืจ ื”ื—ื•ื•ื™ื” ืฉืœื”ื ื”ื•ื ื‘ืจืืฉ ืกื“ืจ ื”ืขื“ื™ืคื•ื™ื•ืช ืฉืœ ืฆื•ื•ืช ืคืœื˜ืคื•ืจืžืช ื”ื ืชื•ื ื™ื ืฉืœ ื˜ื•ื•ื™ื˜ืจ.

ืื ื• ืžืืžื™ื ื™ื ืฉืžืฉืชืžืฉื™ื ื‘ืขืœื™ ืžื’ื•ื•ืŸ ืจื—ื‘ ืฉืœ ื›ื™ืฉื•ืจื™ื ื˜ื›ื ื™ื™ื ืฆืจื™ื›ื™ื ืœื”ื™ื•ืช ืžืกื•ื’ืœื™ื ืœื’ืœื•ืช ื ืชื•ื ื™ื ื•ืœืงื‘ืœ ื’ื™ืฉื” ืœื›ืœื™ ื ื™ืชื•ื— ื•ื”ื“ืžื™ื” ืžื‘ื•ืกืกื™ SQL ื‘ืขืœื™ ื‘ื™ืฆื•ืขื™ื ื˜ื•ื‘ื™ื. ื–ื” ื™ืืคืฉืจ ืœืงื‘ื•ืฆื” ื—ื“ืฉื” ืœื’ืžืจื™ ืฉืœ ืžืฉืชืžืฉื™ื ืคื—ื•ืช ื˜ื›ื ื™ื™ื, ื›ื•ืœืœ ืžื ืชื—ื™ ื ืชื•ื ื™ื ื•ืžื ื”ืœื™ ืžื•ืฆืจ, ืœื—ืœืฅ ืชื•ื‘ื ื•ืช ืžื ืชื•ื ื™ื, ืžื” ืฉื™ืืคืฉืจ ืœื”ื ืœื”ื‘ื™ืŸ ื˜ื•ื‘ ื™ื•ืชืจ ืืช ื”ื™ื›ื•ืœื•ืช ืฉืœ ื˜ื•ื•ื™ื˜ืจ ื•ืœื”ืฉืชืžืฉ ื‘ื”ืŸ. ื›ืš ืื ื• ืขื•ืฉื™ื ื“ืžื•ืงืจื˜ื™ื–ืฆื™ื” ืฉืœ ื ื™ืชื•ื— ื ืชื•ื ื™ื ื‘ื˜ื•ื•ื™ื˜ืจ.

ื›ื›ืœ ืฉื”ื›ืœื™ื ืฉืœื ื• ื•ื™ื›ื•ืœื•ืช ื ื™ืชื•ื— ื”ื ืชื•ื ื™ื ื”ืคื ื™ืžื™ื•ืช ืฉืœื ื• ื”ืฉืชืคืจื•, ืจืื™ื ื• ืืช ื˜ื•ื•ื™ื˜ืจ ืžืฉืชืคืจ. ืขื ื–ืืช, ืขื“ื™ื™ืŸ ื™ืฉ ืžืงื•ื ืœืฉื™ืคื•ืจ. ื›ืœื™ื ื ื•ื›ื—ื™ื™ื ื›ืžื• Scalding ื“ื•ืจืฉื™ื ื ื™ืกื™ื•ืŸ ื‘ืชื›ื ื•ืช. ืœื›ืœื™ ื ื™ืชื•ื— ืžื‘ื•ืกืกื™ SQL ื›ื’ื•ืŸ Presto ื•-Vertica ื™ืฉ ื‘ืขื™ื•ืช ื‘ื™ืฆื•ืขื™ื ื‘ืงื ื” ืžื™ื“ื”. ื™ืฉ ืœื ื• ื’ื ื‘ืขื™ื” ืฉืœ ื”ืคืฆืช ื ืชื•ื ื™ื ืขืœ ืคื ื™ ืžืกืคืจ ืžืขืจื›ื•ืช ืœืœื ื’ื™ืฉื” ืžืชืžื“ืช ืืœื™ื”ื.

ื‘ืฉื ื” ืฉืขื‘ืจื” ื”ื•ื“ืขื ื• ืฉื™ืชื•ืฃ ืคืขื•ืœื” ื—ื“ืฉ ืขื ื’ื•ื’ืœ, ืฉื‘ืชื•ื›ื• ืื ื• ืžืขื‘ื™ืจื™ื ื—ืœืงื™ื ืฉืœื ื• ืชืฉืชื™ืช ื ืชื•ื ื™ื ื‘-Google Cloud Platform (GCP). ื”ื’ืขื ื• ืœืžืกืงื ื” ืฉื›ืœื™ ื”ืขื ืŸ ืฉืœ Google ื ืชื•ื ื™ื ื’ื“ื•ืœื™ื ื™ื›ื•ืœื™ื ืœืขื–ื•ืจ ืœื ื• ืขื ื”ื™ื•ื–ืžื•ืช ืฉืœื ื• ืœื“ืžื•ืงืจื˜ื™ื–ืฆื™ื” ืฉืœ ื ื™ืชื•ื—, ื”ื“ืžื™ื” ื•ืœืžื™ื“ืช ืžื›ื•ื ื” ื‘ื˜ื•ื•ื™ื˜ืจ:

  • BigQuery: ืžื—ืกืŸ ื ืชื•ื ื™ื ืืจื’ื•ื ื™ ืขื ืžื ื•ืข SQL ืžื‘ื•ืกืก ื“ืจืืžืœ, ืืฉืจ ืžืคื•ืจืกืžืช ื‘ื–ื›ื•ืช ื”ืžื”ื™ืจื•ืช, ื”ืคืฉื˜ื•ืช ื•ื”ื”ืชืžื•ื“ื“ื•ืช ืฉืœื” ืœืžื™ื“ืช ืžื›ื•ื ื”.
  • Data Studio: ื›ืœื™ ืœื”ื“ืžื™ื” ืฉืœ ื ืชื•ื ื™ื ื’ื“ื•ืœื™ื ืขื ืชื›ื•ื ื•ืช ืฉื™ืชื•ืฃ ืคืขื•ืœื” ื“ืžื•ื™ื•ืช Google Docs.

ื‘ืžืืžืจ ื–ื” ืชืœืžื“ื• ืขืœ ื”ื ื™ืกื™ื•ืŸ ืฉืœื ื• ืขื ื”ื›ืœื™ื ื”ืืœื”: ืžื” ืขืฉื™ื ื•, ืžื” ืœืžื“ื ื• ื•ืžื” ื ืขืฉื” ื‘ื”ืžืฉืš. ื›ืขืช ื ืชืžืงื“ ื‘ื ื™ืชื•ื— ืืฆื•ื•ื” ื•ืื™ื ื˜ืจืืงื˜ื™ื‘ื™. ื ื“ื•ืŸ ื‘ื ื™ืชื•ื— ื‘ื–ืžืŸ ืืžืช ื‘ืžืืžืจ ื”ื‘ื.

ื”ื™ืกื˜ื•ืจื™ื” ืฉืœ ื—ื ื•ื™ื•ืช ื”ื ืชื•ื ื™ื ืฉืœ ื˜ื•ื•ื™ื˜ืจ

ืœืคื ื™ ืฉืฆื•ืœืœ ืœืชื•ืš BigQuery, ื›ื“ืื™ ืœืกืคืจ ื‘ืงืฆืจื” ืืช ื”ื”ื™ืกื˜ื•ืจื™ื” ืฉืœ ืื—ืกื•ืŸ ื”ื ืชื•ื ื™ื ืฉืœ ื˜ื•ื•ื™ื˜ืจ. ื‘ืฉื ืช 2011, ื‘ื•ืฆืข ื ื™ืชื•ื— ื ืชื•ื ื™ื ื‘ื˜ื•ื•ื™ื˜ืจ ื‘-Vertica ื•-Hadoop. ื”ืฉืชืžืฉื ื• ื‘-Pig ื›ื“ื™ ืœื™ืฆื•ืจ ืžืฉืจื•ืช MapReduce Hadoop. ื‘-2012 ื”ื—ืœืคื ื• ืืช Pig ื‘-Scalding, ืฉื”ื™ื” ืœื• Scala API ืขื ื™ืชืจื•ื ื•ืช ื›ืžื• ื”ื™ื›ื•ืœืช ืœื™ืฆื•ืจ ืฆื™ื ื•ืจื•ืช ืžื•ืจื›ื‘ื™ื ื•ืงืœื•ืช ื‘ื“ื™ืงื”. ืขื ื–ืืช, ืขื‘ื•ืจ ืžื ืชื—ื™ ื ืชื•ื ื™ื ื•ืžื ื”ืœื™ ืžื•ืฆืจ ืจื‘ื™ื ืฉื”ื™ื” ืœื”ื ื ื•ื— ื™ื•ืชืจ ืœืขื‘ื•ื“ ืขื SQL, ื–ื• ื”ื™ื™ืชื” ืขืงื•ืžืช ืœืžื™ื“ื” ืชืœื•ืœื” ืœืžื“ื™. ื‘ืกื‘ื™ื‘ื•ืช 2016, ื”ืชื—ืœื ื• ืœื”ืฉืชืžืฉ ื‘-Presto ื›ืžืžืฉืง SQL ืœื ืชื•ื ื™ Hadoop. Spark ื”ืฆื™ืขื” ืžืžืฉืง Python, ืžื” ืฉื”ื•ืคืš ืื•ืชื• ืœื‘ื—ื™ืจื” ื˜ื•ื‘ื” ืขื‘ื•ืจ ืžื“ืขื™ ื ืชื•ื ื™ื ืื“-ื”ื•ืง ื•ืœืžื™ื“ืช ืžื›ื•ื ื”.

ืžืื– 2018, ื”ืฉืชืžืฉื ื• ื‘ื›ืœื™ื ื”ื‘ืื™ื ืœื ื™ืชื•ื— ื ืชื•ื ื™ื ื•ื”ื“ืžื™ื”:

  • ืฆืจื™ื‘ื” ืขื‘ื•ืจ ืžืกื•ืขื™ ื™ื™ืฆื•ืจ
  • Scalding ื•-Spark ืœื ื™ืชื•ื— ื ืชื•ื ื™ื ืื“-ื”ื•ืง ื•ืœืžื™ื“ืช ืžื›ื•ื ื”
  • Vertica ื•-Presto ืœื ื™ืชื•ื— SQL ืื“-ื”ื•ืง ื•ืื™ื ื˜ืจืืงื˜ื™ื‘ื™
  • ื“ืจื•ืื™ื“ ืขื‘ื•ืจ ื’ื™ืฉื” ืื™ื ื˜ืจืืงื˜ื™ื‘ื™ืช ื ืžื•ื›ื”, ื—ืงื™ืจืช ื•ื”ืฉื”ื™ื™ื” ื ืžื•ื›ื” ืœืžื“ื™ื“ื™ ืกื“ืจื•ืช ื–ืžืŸ
  • Tableau, Zeppelin ื•- Pivot ืœื”ื“ืžื™ื™ืช ื ืชื•ื ื™ื

ื’ื™ืœื™ื ื• ืฉื‘ืขื•ื“ ืฉื”ื›ืœื™ื ื”ืœืœื• ืžืฆื™ืขื™ื ื™ื›ื•ืœื•ืช ื—ื–ืงื•ืช ืžืื•ื“, ื”ืชืงืฉื™ื ื• ืœื”ืคื•ืš ืืช ื”ื™ื›ื•ืœื•ืช ื”ืœืœื• ืœื–ืžื™ื ื•ืช ืœืงื”ืœ ืจื—ื‘ ื™ื•ืชืจ ื‘ื˜ื•ื•ื™ื˜ืจ. ืขืœ ื™ื“ื™ ื”ืจื—ื‘ืช ื”ืคืœื˜ืคื•ืจืžื” ืฉืœื ื• ืขื Google Cloud, ืื ื• ืžืชืžืงื“ื™ื ื‘ืคื™ืฉื•ื˜ ื›ืœื™ ื”ื ื™ืชื•ื— ืฉืœื ื• ืขื‘ื•ืจ ื›ืœ ื˜ื•ื•ื™ื˜ืจ.

ืžื—ืกืŸ ื”ื ืชื•ื ื™ื BigQuery ืฉืœ ื’ื•ื’ืœ

ื›ืžื” ืฆื•ื•ืชื™ื ื‘ื˜ื•ื•ื™ื˜ืจ ื›ื‘ืจ ืฉื™ืœื‘ื• ืืช BigQuery ื‘ื—ืœืง ืžืฆื™ื ื•ืจื•ืช ื”ื™ื™ืฆื•ืจ ืฉืœื”ื. ื‘ืขื–ืจืช ื”ืžื•ืžื—ื™ื•ืช ืฉืœื”ื, ื”ืชื—ืœื ื• ืœื”ืขืจื™ืš ืืช ื”ื™ื›ื•ืœื•ืช ืฉืœ BigQuery ืขื‘ื•ืจ ื›ืœ ืžืงืจื™ ื”ืฉื™ืžื•ืฉ ื‘ื˜ื•ื•ื™ื˜ืจ. ื”ืžื˜ืจื” ืฉืœื ื• ื”ื™ื™ืชื” ืœื”ืฆื™ืข ืืช BigQuery ืœื›ืœ ื”ื—ื‘ืจื” ื•ืœืชืงืŸ ื•ืœืชืžื•ืš ื‘ื” ื‘ืชื•ืš ืขืจื›ืช ื”ื›ืœื™ื ืฉืœ ืคืœื˜ืคื•ืจืžืช ื”ื ืชื•ื ื™ื. ื–ื” ื”ื™ื” ืงืฉื” ืžืกื™ื‘ื•ืช ืจื‘ื•ืช. ื”ื™ื™ื ื• ืฆืจื™ื›ื™ื ืœืคืชื— ืชืฉืชื™ืช ื›ื“ื™ ืœื”ื˜ืžื™ืข ื‘ืื•ืคืŸ ืžื”ื™ืžืŸ ื›ืžื•ื™ื•ืช ื’ื“ื•ืœื•ืช ืฉืœ ื ืชื•ื ื™ื, ืœืชืžื•ืš ื‘ื ื™ื”ื•ืœ ื ืชื•ื ื™ื ื‘ืจื—ื‘ื™ ื”ื—ื‘ืจื”, ืœื”ื‘ื˜ื™ื— ื‘ืงืจื•ืช ื’ื™ืฉื” ื ืื•ืชื•ืช ื•ืœื”ื‘ื˜ื™ื— ืืช ืคืจื˜ื™ื•ืช ื”ืœืงื•ื—ื•ืช. ื”ื™ื™ื ื• ืฆืจื™ื›ื™ื ื’ื ืœื™ืฆื•ืจ ืžืขืจื›ื•ืช ืœื”ืงืฆืืช ืžืฉืื‘ื™ื, ื ื™ื˜ื•ืจ ื•ื”ื—ื–ืจื™ื ื›ืกืคื™ื™ื ื›ื“ื™ ืฉืฆื•ื•ืชื™ื ื™ื•ื›ืœื• ืœื”ืฉืชืžืฉ ื‘-BigQuery ื‘ื™ืขื™ืœื•ืช.

ื‘ื ื•ื‘ืžื‘ืจ 2018, ืฉื—ืจืจื ื• ืžื”ื“ื•ืจืช ืืœืคื ื›ืœืœ ื”ื—ื‘ืจื” ืฉืœ BigQuery ื•-Data Studio. ื”ืฆืขื ื• ืœืขื•ื‘ื“ื™ ื˜ื•ื•ื™ื˜ืจ ื›ืžื” ืžื”ื’ืœื™ื•ื ื•ืช ื”ืืœืงื˜ืจื•ื ื™ื™ื ื”ื ืคื•ืฆื™ื ื‘ื™ื•ืชืจ ืฉืœื ื• ืขื ื ืชื•ื ื™ื ืื™ืฉื™ื™ื ื ืงื™ื™ื. ื‘-BigQuery ื ืขืฉื” ืฉื™ืžื•ืฉ ืขืœ ื™ื“ื™ ืœืžืขืœื” ืž-250 ืžืฉืชืžืฉื™ื ืžืžื’ื•ื•ืŸ ืฆื•ื•ืชื™ื ื›ื•ืœืœ ื”ื ื“ืกื”, ืคื™ื ื ืกื™ื ื•ืฉื™ื•ื•ืง. ืœืื—ืจื•ื ื”, ื”ื ื”ืจื™ืฆื• ื›-8 ื‘ืงืฉื•ืช, ืขื™ื‘ื•ื“ ืฉืœ ื›-100 PB ื‘ื—ื•ื“ืฉ, ืœื ืกื•ืคืจ ื‘ืงืฉื•ืช ืžืชื•ื–ืžื ื•ืช. ืœืื—ืจ ืฉืงื™ื‘ืœื ื• ืžืฉื•ื‘ ื—ื™ื•ื‘ื™ ืžืื•ื“, ื”ื—ืœื˜ื ื• ืœื”ืชืงื“ื ื•ืœื”ืฆื™ืข ืืช BigQuery ื›ืžืฉืื‘ ื”ืขื™ืงืจื™ ืœืื™ื ื˜ืจืืงืฆื™ื” ืขื ื ืชื•ื ื™ื ื‘ื˜ื•ื•ื™ื˜ืจ.

ื”ื ื” ืชืจืฉื™ื ื‘ืจืžื” ื’ื‘ื•ื”ื” ืฉืœ ืืจื›ื™ื˜ืงื˜ื•ืจืช ืžื—ืกื ื™ ื”ื ืชื•ื ื™ื ืฉืœ Google BigQuery ืฉืœื ื•.

ื›ื™ืฆื“ BigQuery ืฉืœ Google ื‘ื™ืฆืขื” ื“ืžื•ืงืจื˜ื™ื” ืฉืœ ื ื™ืชื•ื— ื ืชื•ื ื™ื. ื—ืœืง 1
ืื ื• ืžืขืชื™ืงื™ื ื ืชื•ื ื™ื ืžืืฉื›ื•ืœื•ืช Hadoop ืžืงื•ืžื™ื™ื ืœ-Google Cloud Storage (GCS) ื‘ืืžืฆืขื•ืช ื”ื›ืœื™ ื”ืคื ื™ืžื™ Cloud Replicator. ืœืื—ืจ ืžื›ืŸ ืื ื• ืžืฉืชืžืฉื™ื ื‘-Apache Airflow ื›ื“ื™ ืœื™ืฆื•ืจ ืฆื™ื ื•ืจื•ืช ื”ืžืฉืชืžืฉื™ื ื‘-"bq_loadยป ื›ื“ื™ ืœื˜ืขื•ืŸ ื ืชื•ื ื™ื ืž-GCS ืœืชื•ืš BigQuery. ืื ื• ืžืฉืชืžืฉื™ื ื‘-Presto ื›ื“ื™ ืœื‘ืฆืข ืฉืื™ืœืชื•ืช ืขืœ ืžืขืจื›ื™ ื ืชื•ื ื™ื ืฉืœ Parquet ืื• Thrift-LZO ื‘-GCS. BQ Blaster ื”ื•ื ื›ืœื™ Scalding ืคื ื™ืžื™ ืœื˜ืขื™ื ืช ืžืขืจื›ื™ ื ืชื•ื ื™ื ืฉืœ HDFS Vertica ื•-Thrift-LZO ืœืชื•ืš BigQuery.

ื‘ืกืขื™ืคื™ื ื”ื‘ืื™ื, ืื ื• ื“ื ื™ื ื‘ื’ื™ืฉื” ื•ื‘ืžื•ืžื—ื™ื•ืช ืฉืœื ื• ื‘ืชื—ื•ืžื™ื ืฉืœ ืงืœื•ืช ืฉื™ืžื•ืฉ, ื‘ื™ืฆื•ืขื™ื, ื ื™ื”ื•ืœ ื ืชื•ื ื™ื, ื‘ืจื™ืื•ืช ื”ืžืขืจื›ืช ื•ืขืœื•ืช.

ืงืœื•ืช ืฉื™ืžื•ืฉ

ื’ื™ืœื™ื ื• ืฉืงืœ ืœืžืฉืชืžืฉื™ื ืœื”ืชื—ื™ืœ ืขื BigQuery ืžื›ื™ื•ื•ืŸ ืฉื”ื•ื ืœื ื“ื•ืจืฉ ื”ืชืงื ืช ืชื•ื›ื ื” ื•ืžืฉืชืžืฉื™ื ื™ื›ืœื• ืœื’ืฉืช ืืœื™ื• ื“ืจืš ืžืžืฉืง ืื™ื ื˜ืจื ื˜ ืื™ื ื˜ื•ืื™ื˜ื™ื‘ื™. ืขื ื–ืืช, ื”ืžืฉืชืžืฉื™ื ื”ื™ื• ืฆืจื™ื›ื™ื ืœื”ื›ื™ืจ ื›ืžื” ืžื”ืชื›ื•ื ื•ืช ื•ื”ืžื•ืฉื’ื™ื ืฉืœ GCP, ื›ื•ืœืœ ืžืฉืื‘ื™ื ื›ื’ื•ืŸ ืคืจื•ื™ืงื˜ื™ื, ืžืขืจื›ื™ ื ืชื•ื ื™ื ื•ื˜ื‘ืœืื•ืช. ืคื™ืชื—ื ื• ื—ื•ืžืจื™ื ื•ืžื“ืจื™ื›ื™ ืœื™ืžื•ื“ ื›ื“ื™ ืœืขื–ื•ืจ ืœืžืฉืชืžืฉื™ื ืœื”ืชื—ื™ืœ. ืขื ื”ื‘ื ื” ื‘ืกื™ืกื™ืช ืฉื”ื•ืฉื’ื”, ืœืžืฉืชืžืฉื™ื ื”ื™ื” ืงืœ ืœื ื•ื•ื˜ ื‘ืขืจื›ื•ืช ื ืชื•ื ื™ื, ืœื”ืฆื™ื’ ื ืชื•ื ื™ ืกื›ื™ืžื” ื•ื˜ื‘ืœื”, ืœื”ืจื™ืฅ ืฉืื™ืœืชื•ืช ืคืฉื•ื˜ื•ืช ื•ืœื”ืžื—ื™ืฉ ืชื•ืฆืื•ืช ื‘-Data Studio.

ื”ืžื˜ืจื” ืฉืœื ื• ืœื”ื–ื ืช ื ืชื•ื ื™ื ืœ-BigQuery ื”ื™ื™ืชื” ืœืืคืฉืจ ื˜ืขื™ื ื” ื—ืœืงื” ืฉืœ ืžืขืจื›ื™ ื ืชื•ื ื™ื HDFS ืื• GCS ื‘ืœื—ื™ืฆื” ืื—ืช. ืฉืงืœื ื• ืžืœื—ื™ืŸ ืขื ืŸ (ืžื ื•ื”ืœ ืขืœ ื™ื“ื™ Airflow) ืืš ืœื ื”ืฆืœื™ื—ื• ืœื”ืฉืชืžืฉ ื‘ื• ื‘ื’ืœืœ ืžื•ื“ืœ ื”ืื‘ื˜ื—ื” ืฉืœื ื• ืœืฉื™ืชื•ืฃ ืžื•ื’ื‘ืœ ื‘ื“ื•ืžื™ื™ืŸ (ืขื•ื“ ืขืœ ื›ืš ื‘ืกืขื™ืฃ ื ื™ื”ื•ืœ ื ืชื•ื ื™ื ืœืžื˜ื”). ื ื™ืกื™ื ื• ืœื”ืฉืชืžืฉ ื‘ืฉื™ืจื•ืช ื”ืขื‘ืจืช ื”ื ืชื•ื ื™ื ืฉืœ Google (DTS) ื›ื“ื™ ืœืชื–ืžืŸ ืขื•ืžืกื™ ืขื‘ื•ื“ื” ืฉืœ BigQuery. ื‘ืขื•ื“ ืฉ-DTS ืžื™ื”ืจ ืœื”ื’ื“ื™ืจ, ื”ื•ื ืœื ื”ื™ื” ื’ืžื™ืฉ ืœื‘ื ื™ื™ืช ืฆื™ื ื•ืจื•ืช ืขื ืชืœื•ืช. ืขื‘ื•ืจ ืžื”ื“ื•ืจืช ื”ืืœืคื ืฉืœื ื•, ื‘ื ื™ื ื• ืžืกื’ืจืช ืžืฉืœื ื• ืฉืœ Apache Airflow ื‘-GCE ื•ืžื›ื™ื ื™ื ืื•ืชื” ืœืคืขื•ืœ ื‘ื™ื™ืฆื•ืจ ื•ืœื”ื™ื•ืช ืžืกื•ื’ืœื™ื ืœืชืžื•ืš ื‘ืžืงื•ืจื•ืช ื ืชื•ื ื™ื ื ื•ืกืคื™ื ื›ืžื• Vertica.

ื›ื“ื™ ืœื”ืคื•ืš ื ืชื•ื ื™ื ืœ-BigQuery, ืžืฉืชืžืฉื™ื ื™ื•ืฆืจื™ื ืฆื™ื ื•ืจื•ืช ื ืชื•ื ื™ื ืคืฉื•ื˜ื™ื ืฉืœ SQL ื‘ืืžืฆืขื•ืช ืฉืื™ืœืชื•ืช ืžืชื•ื–ืžื ื•ืช. ืขื‘ื•ืจ ืฆื™ื ื•ืจื•ืช ืžื•ืจื›ื‘ื™ื ืžืจื•ื‘ื™-ืฉืœื‘ื™ื ืขื ืชืœื•ืช, ืื ื• ืžืชื›ื ื ื™ื ืœื”ืฉืชืžืฉ ื‘ืžืกื’ืจืช Airflow ืžืฉืœื ื• ืื• ื‘-Cloud Composer ื™ื—ื“ ืขื ื–ืจื ื ืชื•ื ื™ื ื‘ืขื ืŸ.

ืคืจื•ื“ื•ืงื˜ื™ื‘ื™ื•ืช

BigQuery ืžื™ื•ืขื“ ืœืฉืื™ืœืชื•ืช SQL ืœืžื˜ืจื•ืช ื›ืœืœื™ื•ืช ื”ืžืขื‘ื“ื•ืช ื›ืžื•ื™ื•ืช ื’ื“ื•ืœื•ืช ืฉืœ ื ืชื•ื ื™ื. ื”ื•ื ืื™ื ื• ืžื™ื•ืขื“ ืœืฉืื™ืœืชื•ืช ื”ืื—ื–ื•ืจ ื”ื ืžื•ืš, ื”ืชืคื•ืงื” ื”ื’ื‘ื•ื”ื” ื”ื ื“ืจืฉืช ืขืœ-ื™ื“ื™ ืžืกื“ ื ืชื•ื ื™ื ืขืกืงื”, ืื• ืœื ื™ืชื•ื— ืกื“ืจื•ืช ื–ืžืŸ ืขื ื–ืžืŸ ืื—ื–ื•ืจ ื ืžื•ืš. ืืคืืฆ'ื™ ื“ืจื•ืื™ื“. ืขื‘ื•ืจ ืฉืื™ืœืชื•ืช ื ื™ืชื•ื— ืื™ื ื˜ืจืืงื˜ื™ื‘ื™ื•ืช, ื”ืžืฉืชืžืฉื™ื ืฉืœื ื• ืžืฆืคื™ื ืœื–ืžื ื™ ืชื’ื•ื‘ื” ืฉืœ ืคื—ื•ืช ืžื“ืงื” ืื—ืช. ื”ื™ื™ื ื• ืฆืจื™ื›ื™ื ืœืขืฆื‘ ืืช ื”ืฉื™ืžื•ืฉ ืฉืœื ื• ื‘-BigQuery ื›ื“ื™ ืœืขืžื•ื“ ื‘ืฆื™ืคื™ื•ืช ื”ืœืœื•. ื›ื“ื™ ืœืกืคืง ื‘ื™ืฆื•ืขื™ื ืฆืคื•ื™ื™ื ืœืžืฉืชืžืฉื™ื ืฉืœื ื•, ืžื™ื ืคื ื• ืืช ื”ืคื•ื ืงืฆื™ื•ื ืœื™ื•ืช ืฉืœ BigQuery, ื”ื–ืžื™ื ื” ืœืœืงื•ื—ื•ืช ืขืœ ื‘ืกื™ืก ืขืžืœื” ืงื‘ื•ืขื”, ื”ืžืืคืฉืจืช ืœื‘ืขืœื™ ืคืจื•ื™ืงื˜ื™ื ืœืฉืžื•ืจ ืžืฉื‘ืฆื•ืช ืžื™ื ื™ืžื•ื ืขื‘ื•ืจ ื”ืฉืื™ืœืชื•ืช ืฉืœื”ื. ะกะปะพั‚ BigQuery ื”ื™ื ื™ื—ื™ื“ืช ื›ื•ื— ืžื—ืฉื•ื‘ ื”ื ื“ืจืฉืช ืœื‘ื™ืฆื•ืข ืฉืื™ืœืชื•ืช SQL.

ื ื™ืชื—ื ื• ืœืžืขืœื” ืž-800 ืฉืื™ืœืชื•ืช ืฉืขื‘ื“ื• ื›-1 TB ืฉืœ ื ืชื•ื ื™ื ื›ืœ ืื—ืช ื•ืžืฆืื ื• ืฉื–ืžืŸ ื”ื‘ื™ืฆื•ืข ื”ืžืžื•ืฆืข ื”ื™ื” 30 ืฉื ื™ื•ืช. ืœืžื“ื ื• ื’ื ืฉื‘ื™ืฆื•ืขื™ื ืชืœื•ื™ื™ื ืžืื•ื“ ื‘ืฉื™ืžื•ืฉ ื‘ืžืฉื‘ืฆืช ืฉืœื ื• ื‘ืคืจื•ื™ืงื˜ื™ื ื•ืžืฉื™ืžื•ืช ืฉื•ื ื•ืช. ื”ื™ื™ื ื• ืฆืจื™ื›ื™ื ืœื”ื’ื“ื™ืจ ื‘ื‘ื™ืจื•ืจ ืืช ืขืชื•ื“ื•ืช ื”ื™ื™ืฆื•ืจ ื•ื—ืจื™ืฆื™ ื”ืื“-ื”ื•ืง ืฉืœื ื• ื›ื“ื™ ืœืฉืžื•ืจ ืขืœ ื‘ื™ืฆื•ืขื™ื ืขื‘ื•ืจ ืžืงืจื™ ืฉื™ืžื•ืฉ ื‘ื™ื™ืฆื•ืจ ื•ื ื™ืชื•ื— ืžืงื•ื•ืŸ. ื–ื” ื”ืฉืคื™ืข ืจื‘ื•ืช ืขืœ ื”ืขื™ืฆื•ื‘ ืฉืœื ื• ืœืฉืžื™ืจืช ืžืฉื‘ืฆื•ืช ื•ื”ื™ืจืจื›ื™ื™ืช ื”ืคืจื•ื™ืงื˜.

ื ื“ื‘ืจ ืขืœ ื ื™ื”ื•ืœ ื ืชื•ื ื™ื, ืคื•ื ืงืฆื™ื•ื ืœื™ื•ืช ื•ืขืœื•ืช ืžืขืจื›ื•ืช ื‘ื™ืžื™ื ื”ืงืจื•ื‘ื™ื ื‘ื—ืœืง ื”ืฉื ื™ ืฉืœ ื”ืชืจื’ื•ื, ืืš ื›ืขืช ืื ื• ืžื–ืžื™ื ื™ื ืืช ื›ื•ืœื ืกืžื™ื ืจ ืžืงื•ื•ืŸ ื—ื™ ื‘ื—ื™ื ื, ื‘ืžื”ืœื›ื• ืชื•ื›ืœื• ืœืœืžื•ื“ ื‘ื”ืจื—ื‘ื” ืขืœ ื”ืงื•ืจืก, ื•ื›ืŸ ืœืฉืื•ืœ ืฉืืœื•ืช ืœืžื•ืžื—ื” ืฉืœื ื• - ืื’ื•ืจ ืžืื˜ืฉื•ืง (ืžื”ื ื“ืก ื ืชื•ื ื™ื ื‘ื›ื™ืจ, MaximaTelecom).

ืงืจื ืขื•ื“:

ืžืงื•ืจ: www.habr.com

ื”ื•ืกืคืช ืชื’ื•ื‘ื”