profile
viewpoint
Cristian Petrescu-Prahova cristipp

cristipp/allennlp 0

An open-source NLP research library, built on PyTorch.

cristipp/coref_annotator 0

Web based application/GUI to annotate coreference resolution data over plaintext

cristipp/coronadatascraper 0

COVID-19 Coronavirus data scraped from government and curated data sources.

cristipp/decaNLP 0

The Natural Language Decathlon: A Multitask Challenge for NLP

cristipp/escape 0

Sivia's first escape game

cristipp/hmtl 0

🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP

cristipp/spider-schema-gnn 0

Author implementation of the paper "Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing"

cristipp/spider-schema-gnn-global 0

Author implementation of Global Reasoning over Database Structures for Text-to-SQL Parsing

startedtc39/proposal-record-tuple

started time in a month

startedcwida/duckdb

started time in a month

startednteract/semiotic

started time in 2 months

startedamitness/toolbox

started time in 2 months

startedocornut/imgui

started time in 2 months

issue commentcovidatlas/coronadatascraper

Number of US states are missing deaths/tested

Good to have the state level data fixed. We're still lacking county level data for NY fatalities:

coronadatascraper: [ {key: 'US-NY', date: '2020-04-22', cases: 257216, tested: 669982, deaths: 15302}, {key: 'US-NY-Queens County', date: '2020-04-22', cases: 43713, tested: 88388, deaths: undefined}, {key: 'US-NY-Kings County', date: '2020-04-22', cases: 38481, tested: 81787, deaths: undefined}, {key: 'US-NY-Nassau County', date: '2020-04-22', cases: 31555, tested: 74571, deaths: undefined}, {key: 'US-NY-Bronx County', date: '2020-04-22', cases: 30868, tested: 65304, deaths: undefined}, {key: 'US-NY-Suffolk County', date: '2020-04-22', cases: 28854, tested: 71268, deaths: undefined}, {key: 'US-NY-Westchester County', date: '2020-04-22', cases: 25276, tested: 76564, deaths: undefined}, {key: 'US-NY-New York County', date: '2020-04-22', cases: 19025, tested: 49687, deaths: undefined}, {key: 'US-NY-Richmond County', date: '2020-04-22', cases: 10345, tested: 26289, deaths: undefined}, {key: 'US-NY-Rockland County', date: '2020-04-22', cases: 9699, tested: 23150, deaths: undefined}, ... 53 more ]

The county level fatalities can be pulled from NYT or USAFacts, with the quirk that NYT sums the 5 counties of NYC together. Also note these sources don't report 'tested', which CoronaDataScraper does:

nyt: [ {key: 'US-NY', date: '2020-04-22', cases: 257246, deaths: 15302}, {key: 'US-NY-New York City', date: '2020-04-22', cases: 142442, deaths: 10614}, {key: 'US-NY-Nassau', date: '2020-04-22', cases: 31555, deaths: 1764}, {key: 'US-NY-Suffolk', date: '2020-04-22', cases: 28854, deaths: 959}, {key: 'US-NY-Westchester', date: '2020-04-22', cases: 25275, deaths: 932}, {key: 'US-NY-Rockland', date: '2020-04-22', cases: 9699, deaths: 309}, {key: 'US-NY-Orange', date: '2020-04-22', cases: 6705, deaths: 183}, {key: 'US-NY-Dutchess', date: '2020-04-22', cases: 2391, deaths: 57}, {key: 'US-NY-Erie', date: '2020-04-22', cases: 2233, deaths: 174}, {key: 'US-NY-Monroe', date: '2020-04-22', cases: 1112, deaths: 72}, ... 50 more ] usafacts: [ {key: 'US-NY-Queens County', date: '2020-04-22', cases: 43713, deaths: 3432}, {key: 'US-NY-Kings County', date: '2020-04-22', cases: 38481, deaths: 3458}, {key: 'US-NY-Nassau County', date: '2020-04-22', cases: 31555, deaths: 1431}, {key: 'US-NY-Bronx County', date: '2020-04-22', cases: 31130, deaths: 2258}, {key: 'US-NY-Suffolk County', date: '2020-04-22', cases: 28854, deaths: 926}, {key: 'US-NY-Westchester County', date: '2020-04-22', cases: 25276, deaths: 838}, {key: 'US-NY-New York County', date: '2020-04-22', cases: 19025, deaths: 1337}, {key: 'US-NY-Richmond County', date: '2020-04-22', cases: 10405, deaths: 492}, {key: 'US-NY-Rockland County', date: '2020-04-22', cases: 9699, deaths: 334}, {key: 'US-NY-Orange County', date: '2020-04-22', cases: 6690, deaths: 185}, ... 54 more ]

  • We have #876, which adds county-level fatalities data from NYT. Alas, that PR does not fill in data for NYC counties [Kings, Queens, New York, Bronx, Richmond, see https://en.wikipedia.org/wiki/Boroughs_of_New_York_City] because of the aforementioned quirk, which makes it less useful than I'd like.

  • There is a general question on whether CoronaDataScraper wants to 'fallback' missing data from other metaaggregators as a post-processing step on a cell-by-cell basis. Think of an extra field 'fallback: true', which enables the fallback behavior on a source by source basis, in priority order. Then we could add nyt, usafacts and whatnot. Would that be something of interest?

zbraniecki

comment created time in 2 months

push eventcristipp/coronadatascraper

Cristian Petrescu-Prahova

commit sha e8c3af5e8a7e6e41796fbe03df784e53ab70734e

ok

view details

push time in 3 months

pull request commentcovidatlas/coronadatascraper

NY scraper using NYT county data. Scrapes cases + deaths, nothing else.

Makes sense. Updated to reuse the nyt-county.js scraper.

cristipp

comment created time in 3 months

push eventcristipp/coronadatascraper

Cristian Petrescu-Prahova

commit sha 26ad0648de89b60101f33c0591e2d4d656932e14

ok

view details

Cristian Petrescu-Prahova

commit sha d8c240f36ff63e776642ab939a77c7402ffed652

ok

view details

push time in 3 months

PR opened covidatlas/coronadatascraper

NY scrapper using NYT county data, fills in deaths column.

Pulls NY data from NYT metaagregator. See #823. Tradeoff: Only handles cases + deaths, nothing else.

Note: This clones US/nyt-counties.js. Ideally, this should be a function parametrized on state. Not sure where in the code-base to put such a function, so let's go with a clone.

+100 -0

0 comment

1 changed file

pr created time in 3 months

create barnchcristipp/coronadatascraper

branch : nyt2

created branch time in 3 months

push eventcristipp/coronadatascraper

Cristian Petrescu-Prahova

commit sha 95d446e514de27e7c80a16cce175b5c7d9201aba

ok

view details

push time in 3 months

fork cristipp/coronadatascraper

COVID-19 Coronavirus data scraped from government and curated data sources.

https://coronadatascraper.com

fork in 3 months

push eventcristipp/coronadatascraper

Cristian Petrescu-Prahova

commit sha 4c8529e2409ab561c805338d95d7c490dffcaf63

ok

view details

Cristian Petrescu-Prahova

commit sha 3bc491fbaf3c4a77a9132ac92f0c6a12a98cc621

ok

view details

push time in 3 months

fork cristipp/coronadatascraper

COVID-19 Coronavirus data scraped from government and curated data sources.

https://coronadatascraper.com

fork in 3 months

issue commentcovidatlas/coronadatascraper

Death and recovered data issue

I stumbled upon https://covid19tracker.health.ny.gov/views/NYS-COVID19-Tracker/NYSDOHCOVID-19Tracker-Fatalities?%3Aembed=yes&%3Atoolbar=no&%3Atabs=n, showing fatalities per NY county

Karthi9934

comment created time in 3 months

startedfacebookexperimental/rome

started time in 3 months

more