I get where you're coming from but I think you're missing the point. The issue s...

paxys · on Sept 30, 2020

As an open source maintainer, I'd gladly sift through a mountain of spammy PRs (heck closing 4 per hour, as called out in the article, is almost zero trouble), if it means even a handful of real significant progress and issues fixed and potential future maintainers.

hcho3 · on Sept 30, 2020

+1. I feel the same way. If one out of 20 drive-by contributors stick around and become regular, that would be a real win for me. (I'm currently maintaining a project with 20k GitHub stars and we have four regular contributors.)

pseudalopex · on Sept 30, 2020

That's a big if.

paxys · on Sept 30, 2020

Well this thing has been running for a few years now so I'm sure someone has that data

mundo · on Sept 30, 2020

https://github.com/MattIPv4/hacktoberfest-data

paxys · on Sept 30, 2020

A snapshot of one year's participation in that specific period isn't too relevant to what we are discussing, because it doesn't track sustained future contributions by those same users.

justinclift · on Sept 30, 2020

That's a good point. Wonder if they're be open to adding something like that...?

Maybe suggest that in an issue on that stats tracking repo?

gadgetoid · on Oct 1, 2020

In theory they are. In practise the lack of a test dataset - and the lack of access to their dataset - means it's virtually impossible for a third party to make any significant contribution to the data processing code.

Such an effort would have to start with them voluneering a test dataset and/or schema.

I have asked - https://github.com/MattIPv4/hacktoberfest-data/issues/5

mundo · on Sept 30, 2020

I'm not missing that point, I'm asking whether it's true. The right way to answer that would probably be to find out how many of the new submitters from previous years went on to continue to become valuable contributors.