can you help collect package popularity data?

Tim Abell tim at timwise.co.uk
Sun Apr 11 11:52:56 CEST 2010



Alfa21 wrote:
> 2010-04-10 at 23:36 Tim Abell
>
>   
>> The code is at http://github.com/timabell/popularity
>> improvements would be welcome, as would inclusion in the various distros.
>> Perhaps if it works out this could be hosted by the various distro servers.
>>     
>
> nice idea...
> but it works only with opkg based distros and only with _packages_ (no manually installed tgz or compliled sources)
>
>   
i'm only interested in opkg distros (shr) at the moment. apt bases ones 
could presumably use popcon if they want this functionality. tracking 
multiple distros would be a much more ambitious project, needing more 
standardised communications, and a way of matching up varied package 
name conventions to actual software. You are welcome to have a go :-) 
One that tracks software accross *all* platforms (including windows) 
would certainly be awesome for the OSS community.

I don't think I mind missing other methods of installation, as generally 
all programs that see any decent usage *should* be packaged properly 
(it's very easy to provide an ipk, even if you don't put it any feeds). 
Besides, the itch I was scratching was for an app that *is* packaged <grin>.

I think it would be harder to figure out what has been installed from 
source & tgz, they can be too varied to track. (patches welcome though :-)
> also, it seems you do not do any check on duplicated submissions from the same user.
>
>   
indeed, I'd need a unique id for that. the trade off for that is 
privacy. not that i'm particulary against it, I just wanted "some 
numbers" as opposed to "no numbers", so wanted to get something out 
there without trying to make it perfect (and therefore never releasing it)
> so I think a human submitted simple "poll" is more useful.
>   
i think human polls have their own value, but I think you are more 
likely to get complete data from something more automated. for what I 
wanted, the programmatic approach suits best.
> fixed choices with the label of the application (imho version number is not strictly relevant)
version is part of the default output of "opkg list-installed". I 
stripped it out when processing the data to just get a count & package 
name. see http://www.timwise.co.uk/om/pop/summary20100410.txt for an 
example. Version number might be useful to see if people are running 
out-dated / insecure versions, prompting a push to get everyone upgraded.
>  and the possibility for any user to add a new slot for missing ones and when submitted these new slots are converted to fixed choices for the others.
> every user should have own identifier so it's possible to update the own list of installed stuff.
>   
it would be good for a user to have their own unique id that they could 
enter as well as a per machine id, but it would have to be optional as 
it might put people off. better data vs. privacy again I think.
> the date of submission should be collected too.
>   
the filenames contain the php date output, which presumably can be 
turned back into a meaningful value.
> and ask a captcha on submission (bots are everywhere!)
>   
I'll deal with that when I have a problem with it ;-)

Thanks for your email, it's great to bounce these ideas around.

Yours

Tim Abell



More information about the community mailing list