10 Distributions
174,054 Total Packages
3,656,731 Releases
3,040 Upstream Packages
Welcome
OpenSourceWatershed is a project aimed at understanding the relationship between distributions (downstream) and the individual software components (upstream). It is the basis for a larger study of distributions and their evolution. It is distrology. In the future, more distro oriented statistics will be available. More details are below. For now search in the top right for your favorite package to see how up to date the different distributions. Or look at the right to see what new releases happened in the last 24 hours.

The aggregate analysis is done over twenty packages including firefox, gcc and openssh. The full package list is in the OSCON slides. In the future, users will be able to set custom groups. The three forms of analysis are percent obsolete, the average number of newer releases per package and the time since the oldest new release. In other words the lag is the amount of time a distro had to move to a newer package.

There are errors in the database which you can help fix. Just email me if you find one. In the future, you will be able to fix it yourself. For more information about the process behind this analysis please read my senior thesis or email me.
Current Distros
Rank Distro Codename % Obsolete Avg # New Rels Avg Lag
1 arch 30.30% 5.88 6d
2 fedora 18 54.83% 1.32 6w
3 ubuntu raring 57.57% 12.36 12w
4 gentoo 60.60% 14.55 25w
5 sabayon 5 60.60% 15.42 8w
6 funtoo 60.60% 14.21 23w
7 freebsd 8 68.96% 22.10 52w
8 slackware 14.0 84.21% 7.05 24w
9 debian wheezy 87.5% 26.34 39w
10 opensuse 12.2 90.62% 15.00 32w
Future Distros
Rank Distro Codename % Obsolete Avg # New Rels Avg Lag
1 gentoo 27.27% 1.58 1w
2 arch 27.27% 5.85 6d
3 funtoo 27.27% 1.58 1w
4 slackware current 31.57% 0.58 2w
5 ubuntu saucy 42.42% 15.70 29w
6 opensuse 12.3 43.75% 2.91 8w
7 sabayon 5 57.57% 15.39 8w
8 fedora 19 64.51% 1.48 7w
9 freebsd 9 67.85% 20.93 35w
10 debian jessie 81.25% 15.69 32w
Future
Much more work will happen to OSWatershed in the future. The most urgent addition is distro pages which will feature data on multiple branches. User accounts are also coming in the future. Users will be able to add more data into the database and configure their own package groups to use for analysis. Ideally, spreading the work over a larger number of users (crowd-sourcing) will make the data scope more manageable. To get Scott working on these new feature email him and let him know you are waiting!

©2009 Scott Shawcroft | CC-by 3.0 US - contact - git - trac