Habrastatistics: Habr anorarama sei pasina geektimes

Hei Habr.

Ichi chinyorwa ndechekuenderera mberi kwechiyero Izvo zvakanakisa Habr zvinyorwa zve2018. Uye kunyange zvazvo gore risati rapera, sezvamunoziva, muzhizha pane kuchinja kwemitemo, maererano naizvozvo, zvakava zvinonakidza kuona kana izvi zvakakanganisa chero chinhu.

Habrastatistics: Habr anorarama sei pasina geektimes

Pamusoro pehuwandu chaihwo, chiyero chakagadziridzwa chezvinyorwa chichapihwa, pamwe neimwe kodhi kodhi yeavo vanofarira maitirwo ayo.

Kune avo vanofarira zvakaitika, kuenderera mberi kuri pasi pekucheka. Avo vanofarira kuongorora kwakadzama kwezvikamu zvesaiti vanogonawo kutarisa chikamu chinotevera.

Data yekutanga

Chiyero ichi hachina pamutemo, uye ini handina ruzivo rwemukati. Sezvauri kuona zviri nyore nekutarisa kero kero yebrowser yako, zvese zvinyorwa paHabrΓ© zvine nhamba inoenderera. Zvadaro inyaya yeunyanzvi, tinongoverenga zvinyorwa zvose mumutsara mumucheka (mune thread imwe uye nekumira, kuitira kuti tisatakura sevha). Iwo maitiro ega akawanikwa neakapusa parser muPython (zvinyorwa zviripo pano) uye yakachengetwa mu csv faira chimwe chinhu chakadai:

2019-08-11T22:36Z,https://habr.com/ru/post/463197/,"Blazor + MVVM = Silverlight наносит ΠΎΡ‚Π²Π΅Ρ‚Π½Ρ‹ΠΉ ΡƒΠ΄Π°Ρ€, ΠΏΠΎΡ‚ΠΎΠΌΡƒ Ρ‡Ρ‚ΠΎ Π΄Ρ€Π΅Π²Π½Π΅Π΅ Π·Π»ΠΎ Π½Π΅ΠΏΠΎΠ±Π΅Π΄ΠΈΠΌΠΎ",votes:11,votesplus:17,votesmin:6,bookmarks:40,views:5300,comments:73
2019-08-11T05:26Z,https://habr.com/ru/news/t/463199/,"Π’ NASA испытали систСму Π°Π²Ρ‚ΠΎΠ½ΠΎΠΌΠ½ΠΎΠ³ΠΎ управлСния ΠΎΠ΄Π½ΠΎΠ³ΠΎ микроспутника Π΄Ρ€ΡƒΠ³ΠΈΠΌ",votes:15,votesplus:15,votesmin:0,bookmarks:2,views:1700,comments:7

Processing

Pakuparadzanisa isu tichashandisa Python, Pandas uye Matplotlib. Avo vasingafarire manhamba vanogona kusvetuka chikamu ichi vonanga kuzvinyorwa.

Kutanga iwe unofanirwa kurodha dataset mundangariro uye sarudza data yegore raunoda.

import pandas as pd
import datetime
import matplotlib.dates as mdates
from matplotlib.ticker import FormatStrFormatter
from pandas.plotting import register_matplotlib_converters


df = pd.read_csv("habr.csv", sep=',', encoding='utf-8', error_bad_lines=True, quotechar='"', comment='#')
dates = pd.to_datetime(df['datetime'], format='%Y-%m-%dT%H:%MZ')
df['datetime'] = dates
year = 2019
df = df[(df['datetime'] >= pd.Timestamp(datetime.date(year, 1, 1))) & (df['datetime'] < pd.Timestamp(datetime.date(year+1, 1, 1)))]

print(df.shape)

Zvinoitika kuti gore rino (kunyangwe risati rapera) panguva yekunyora, zvinyorwa 12715 zvakaburitswa. Kuenzanisa, kwegore rose ra2018 - 15904. Kazhinji, zvakawanda - izvi zvinenge zvinyorwa zve 43 pazuva (uye izvi zvinongova nechiyero chakanaka; zvingani zvinyorwa zvese zvakatorwa zvakaenda zvisina kunaka kana kuti zvakadzimwa, munhu anogona chete. fembera kana kufungidzira kubva pamikaha pakati pezviziviso).

Ngatisarudzei minda inodiwa kubva kune dataset. Semetrics isu tichashandisa nhamba yemaonero, makomendi, rating values ​​uye nhamba yemabhukimaki.

def to_float(s):
    # "bookmarks:22" => 22.0
    num = ''.join(i for i in s if i.isdigit())
    return float(num)

def to_int(s):
    # "bookmarks:22" => 22
    num = ''.join(i for i in s if i.isdigit())
    return int(num)

def to_date(dt):
    return dt.date() 

date = dates.map(to_date, na_action=None)
views = df["views"].map(to_int, na_action=None)
bookmarks = df["bookmarks"].map(to_int, na_action=None)
votes = df["votes"].map(to_float, na_action=None)
votes_up = df["up"].map(to_float, na_action=None)
votes_down = df["down"].map(to_float, na_action=None)
comments = df["comments"].map(to_int, na_action=None)

df['date'] = date
df['views'] = views
df['votes'] = votes
df['bookmarks'] = bookmarks
df['up'] = votes_up
df['down'] = votes_down

Ikozvino data yawedzerwa kune dataset uye isu tinogona kuishandisa. Ngatiunganidzei data nezuva uye titore avhareji kukosha.

g = df.groupby(['date'])
days_count = g.size().reset_index(name='counts')
year_days = days_count['date'].values
grouped = g.median().reset_index()
grouped['counts'] = days_count['counts']
counts_per_day = grouped['counts'].values
counts_per_day_avg = grouped['counts'].rolling(window=20).mean()
view_per_day = grouped['views'].values
view_per_day_avg = grouped['views'].rolling(window=20).mean()
votes_per_day = grouped['votes'].values
votes_per_day_avg = grouped['votes'].rolling(window=20).mean()
bookmarks_per_day = grouped['bookmarks'].values
bookmarks_per_day_avg = grouped['bookmarks'].rolling(window=20).mean()

Iye zvino chikamu chinonakidza ndechekuti tinogona kutarisa magirafu.

Ngatitarisei huwandu hwezvinyorwa paHabrΓ© muna 2019.

import matplotlib.pyplot as plt

plt.rcParams["figure.figsize"] = (16, 8)
fig, ax = plt.subplots()

plt.bar(year_days, counts_per_day, label='Articles/day')
plt.plot(year_days, counts_per_day_avg, 'g-', label='Articles avg/day')
plt.xticks(rotation=45)
ax.xaxis.set_major_formatter(mdates.DateFormatter("%d-%m-%Y"))  
ax.xaxis.set_major_locator(mdates.MonthLocator(interval=1))
plt.legend(loc='best')
plt.tight_layout()
plt.show()

Mhedzisiro yacho inofadza. Sezvauri kuona, Habr anga ari "soseji" diki gore rese. Handizivi chikonzero.

Habrastatistics: Habr anorarama sei pasina geektimes

Kuenzanisa, 2018 inotarisa zvishoma zvishoma:

Habrastatistics: Habr anorarama sei pasina geektimes

Kazhinji, ini handina kuona kuderera kwakanyanya kwehuwandu hwezvinyorwa zvakaburitswa muna 2019 pagirafu. Uyezve, pane zvinopesana, inoratidzika kunge yakatowedzera zvishoma kubva muzhizha.

Asi magirafu maviri anotevera anondishungurudza zvishoma.

Avhareji yenhamba yemaonero pachinyorwa chimwe nechimwe:

Habrastatistics: Habr anorarama sei pasina geektimes

Avhareji yekuyera pachinyorwa chimwe nechimwe:

Habrastatistics: Habr anorarama sei pasina geektimes

Sezvauri kuona, avhareji yenhamba yemaonero inodzikira zvishoma gore rose. Izvi zvinogona kutsanangurwa nenyaya yekuti zvinyorwa zvitsva hazvisati zvanyorwa neinjini dzekutsvaga, uye hazviwanikwe kakawanda. Asi kuderera kweavhareji yekuyera pachinyorwa kunonyanya kusanzwisisika. Manzwiro acho ndeekuti vaverengi vangangove vasina nguva yekutarisa mukati mezvinyorwa zvakawanda kana kusateerera zviyero. Kubva pakuona kwechirongwa chemubairo wemunyori, maitiro aya haafadzi.

Nenzira, izvi hazvina kuitika muna 2018, uye purogiramu yakawedzera kana zvishoma.

Habrastatistics: Habr anorarama sei pasina geektimes

Kazhinji, varidzi vezviwanikwa vane chimwe chinhu chekufunga nezvazvo.

Asi ngatirege kutaura pamusoro pezvinhu zvinosuruvarisa. Kazhinji, tinogona kutaura kuti Habr "akapona" zhizha rinoshanduka zvakabudirira, uye nhamba yezvinyorwa pane saiti haina kuderera.

Rating

Zvino, chaizvoizvo, rating. Makorokoto kune vakapinda mairi. Rega ndikuyeuchidze zvakare kuti chiyero hachina pamutemo, pamwe ndakapotsa chimwe chinhu, uye kana chimwe chinyorwa chinofanira kunge chiri pano, asi hachisi, nyora, ndichachiwedzera nemaoko. Sechiyero, ini ndinoshandisa akaverengerwa metrics, yandinofunga kuti yakazove inonakidza.

Zvinyorwa zvepamusoro nehuwandu hwemaonero

Zvinyorwa zvepamusoro maererano nechiyero chemaonero

Zvinyorwa zvepamusoro nemaonero kune reshiyo yekuona

Nyaya dzepamusoro dzinonetsa

Zvinyorwa zvepamusoro nekuyera

Zvinyorwa zvepamusoro nehuwandu hwemabhukimaki

Pamusoro nereshiyo yemabhukimaki kune maonero

Zvinyorwa zvepamusoro nehuwandu hwemashoko

Uye pakupedzisira, wekupedzisira Antitop nehuwandu hwekusada

Ugh. Ndine mamwe mashoma anonakidza sarudzo, asi ini handisi kuzonetsa vaverengi.

mhedziso

Pakugadzira chiyero, ndakateerera pfungwa mbiri dzaiita sedzinonakidza.

Chekutanga, 60% yepamusoro izvinyorwa zve "geektimes" genre. Kana pachava nevashoma vavo gore rinouya, uye kuti Habr ichaita sei pasina zvinyorwa pamusoro pedoro, nzvimbo, mushonga, nezvimwewo, handizivi. Zvechokwadi, vaverengi vacharasikirwa nechimwe chinhu. Ngationei.

Chechipiri, mabhukimaki epamusoro akave emhando yepamusoro isingatarisirwe. Izvi zvinonzwisisika mupfungwa; vaverengi vanogona kusateerera kuyera, asi kana chinyorwa zvinodiwa, zvino ichawedzerwa kumabhukimaki ako. Uye heino chaizvo iyo yakanyanya kuunganidzwa yezvinyorwa zvinobatsira uye zvakakomba. Ndinofunga kuti varidzi vesaiti vanofanirwa kufunga nezvekubatana pakati pehuwandu hwemabhukimaki uye chirongwa chemubairo kana vachida kuwedzera chikamu ichi chezvinyorwa pano paHabrΓ©.

Chinhu chakadai. Ndinovimba yanga ine ruzivo.

Rondedzero yezvinyorwa yakave yakareba, zvakanaka, ingangove iri nani. Kufara kuverenga munhu wese.

Source: www.habr.com

Voeg