Def not what you're asking, but there's thesehttps://github.com/trendinghttps://gitlab.com/explore/projects/trendingProbably have the same problem as pkgstats
Removing any package that starts with lib should get rid of most of them. If your dataset has a libs section like debian, remove those too. Then you can manually review any stragglers.
Def not what you're asking, but there's these
https://github.com/trending
https://gitlab.com/explore/projects/trending
Probably have the same problem as pkgstats