Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Characteristics of Sustainable OSS Projects:

A Theoretical and Empirical Study
!
Hideaki Hata, Taiki Todo, Saya Onoue, Kenichi Matumoto
1
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 2
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Toward Sustainable OSS
How can OSS projects attract developers?

!
What can OSS projects do to incentivize
developers to write code?
3
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Context
GHTorrent datasets

• Top-10 starred software
projects for the top
programming languages on
Github: 90 projects

Filtering

• More than 3 year histories
on Dec. 2012: 22 projects
4
Gousios, MSR 2014 Mining Challenge Dataset in GHTorrent
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
SW Population Pyramids
• Right: coding contributors

• Left: non-coding (comments, issues) contributors
5
Onoue et al., Software population pyramids:

the current and the future of OSS development, ESEM 2014.
Experience
Developers
3 months
}
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Varieties of SPP
GHTorrent datasets

• Top-10 starred software
projects for the top
programming languages on
Github: 90 projects

Filtering

• More than 3 year histories
on Dec. 2012: 22 projects
6
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
akka
beanstalkd
blueprint−css
compass
devise
django−cms
django−debug−toolbar
homebrew
http−parser
jekyll
jquerykestrel
MaNGOS
mongo
node
openFrameworks
paperclip
rails
redis
scalatra
ThinkUP
tornado
5
10
15
20
5 10 15 20
# of coding bars
#ofdiscussionbars
Distribution of OSS Projects
7
# of coding bars
#ofnon-codingbars
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
akka
beanstalkd
blueprint−css
compass
devise
django−cms
django−debug−toolbar
homebrew
http−parser
jekyll
jquerykestrel
MaNGOS
mongo
node
openFrameworks
paperclip
rails
redis
scalatra
ThinkUP
tornado
5
10
15
20
5 10 15 20
# of coding bars
#ofdiscussionbars
Distribution of OSS Projects
8
# of coding bars
#ofnon-codingbars
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
akka
beanstalkd
blueprint−css
compass
devise
django−cms
django−debug−toolbar
homebrew
http−parser
jekyll
jquerykestrel
MaNGOS
mongo
node
openFrameworks
paperclip
rails
redis
scalatra
ThinkUP
tornado
5
10
15
20
5 10 15 20
# of coding bars
#ofdiscussionbars
Distribution of OSS Projects
9
# of coding bars
#ofnon-codingbars
(a)
(c)
(b)
(d)
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Introducing

game theory
10
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Game Theoretical Model
A leader-follower game

• Project: keep (K) or setup (S)

• Developer: write code (C) or
non-coding contribution
(discussion, D)
11
developer
project
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Results of Equilibrium Analysis:!
Incentivize Developers to Write Code
• Setup: To increase the utility of writing code compared to
the utility of just non-coding contributions, projects need to
setup the development environment, which can decrease
the cost of writing code.

• Mandatory: Employment is a big incentive to write code.
The project itself or other third-parties can select this
option.

• Innovation: Innovations can decrease the cost and may
increase the reward.
12
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Empirical Analysis
13
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Coverage of Setup
14
TABLE I
COVERAGE OF SETUP
Area Project Wiki Website How to Contribute Coding Guideline Multi-Language Document # of yes
(b)
rails no yes yes no no 2
jekyll yes yes yes no no 3
django-cms no yes no no no 1
jquery yes yes yes yes no 4
paperclip yes yes yes no yes 4
homebrew yes yes yes yes yes 5
node yes yes yes no no 3
tornade yes yes no no no 2
devise yes yes yes no no 3
redis yes yes no no no 2
openFrameworks yes yes no no no 2
compass yes yes yes yes no 4
(d)
mongo no yes yes yes no 3
akka no yes no no no 1
(a)
ThinkUP yes yes no no no 2
django-debug-toolbar yes yes no no no 2
http-parser no no no no no 0
beanstalkd yes yes no no no 2
MaNGOS no yes no no no 1
kestrel yes yes no no no 2
(c)
scalatra no yes no no no 1
blueprint-css yes yes no yes no 3
hosting service and social networking system for developers, have balanced software population pyramids with many coding
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Employment
15
akka
beanstalkd
blueprint−css
compass
devise
django−cms
django−debug−toolbar
homebrew
http−parser
jekyll
jquerykestrel
MaNGOS
mongo
node
openFrameworks
paperclip
rails
redis
scalatra
ThinkUP
tornado
5
10
15
20
5 10 15 20
# of coding bars
#ofdiscussionbars
# of coding bars
#ofnon-codingbars
25 paid in 116
12 paid in 34
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Innovation: Impact of GitHub
16
1 year
2 years
3 years
4 years
5 years
6 years
7 years
8 years
5 0 5
2007/12
1 year
2 years
3 years
4 years
5 years
6 years
7 years
8 years
200 100 0 100 200
2008/12
1 year
2 years
3 years
4 years
5 years
6 years
7 years
8 years
200 100 0 100 200
2009/12
1 year
2 years
3 years
4 years
5 years
6 years
7 years
8 years
200 100 0 100 200
2010/12
1 year
2 years
3 years
4 years
5 years
6 years
7 years
8 years
700 350 0 350 700
2011/12
1 year
2 years
3 years
4 years
5 years
6 years
7 years
8 years
700 350 0 350 700
2012/12
ategory coding discusscussion movedsion moveddiscussion ategory codingmovediscussioncoding discussionscussion moved
Fig. 7. The transition of software population pyramids of the rails project. This project moved to GitHub on 2008/4
C. The Impact of Innovation the analysis to a bit different situations, such as a new market
with a slight modified pricing rules and/or a social laws.
Rails moved to GitHub on August 2008
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Summary
To attract and retain coding contributors

• Prepare documents (setup)

• Have paid developers (employment)

• Adopt new technologies/environment (innovation)
17
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Discussions
• Limitations in theoretical analysis

• Bounded rationality. Humans are not rational in general

• Too much simplification

• Threats to validity in empirical analysis

• Limited datasets

• Analysis result may have error
18
Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015
Future Directions
• Integration of theory and empirical analysis
is strong approach for

• understanding human behaviors

• designing desirable environments
19

More Related Content

PDF
Standing out as a new grad candidate
PDF
GraphQL Summit 重點整理
PPTX
GraphQL - Missing Link In REST
PDF
Overview of Developer Bootcamps
PPTX
Demand driven Applications with GraphQL
PPTX
Everything you want to need to know about GraphQL
PDF
Introduction to Python Syntax and Semantics
PDF
How to GraphQL
Standing out as a new grad candidate
GraphQL Summit 重點整理
GraphQL - Missing Link In REST
Overview of Developer Bootcamps
Demand driven Applications with GraphQL
Everything you want to need to know about GraphQL
Introduction to Python Syntax and Semantics
How to GraphQL

Similar to Characteristics of Sustainable OSS Projects:
 A Theoretical and Empirical Study (20)

PDF
What's new in the latest source{d} releases!
PDF
QCon'17 talk: CI/CD at scale - lessons from LinkedIn and Mockito
PDF
CI/CD: Lessons from LinkedIn and Mockito
PDF
Info Session GDSC USICT
PDF
The AI UX Loop: From Tool to Experience at GitLab (2025-03-18 @ Tangível UX Day)
PPTX
WhoIsKenDeLong
PDF
Designing in the open
PDF
Flow: A living full-stack framework for the web
DOCX
DineshCV (1)
PDF
Citizen Developer Tools @ Valo Solutions / Blue Meteorite Monday session
ODP
Google summer of code 2012
PPTX
Need to reboot your content creation strategy? Start with "No"
PPTX
Citizen Developer Tools are not just for Citizen Developers (session at Share...
PPTX
Ai progress = leaderboards compute data algorithms 20180817 v3
PPTX
Maintainable Machine Learning Products
PDF
A design system. A year in review.
PPTX
AI and the Future of Devs: Nakov @ Techniverse (Nov 2024)
PPTX
Timeless TechComm Tips - STC New England Interchange Keynote 2020
PDF
Engage 2020: Six Polite Ways to Design a RESTful API for Your Application!
PDF
Resume
What's new in the latest source{d} releases!
QCon'17 talk: CI/CD at scale - lessons from LinkedIn and Mockito
CI/CD: Lessons from LinkedIn and Mockito
Info Session GDSC USICT
The AI UX Loop: From Tool to Experience at GitLab (2025-03-18 @ Tangível UX Day)
WhoIsKenDeLong
Designing in the open
Flow: A living full-stack framework for the web
DineshCV (1)
Citizen Developer Tools @ Valo Solutions / Blue Meteorite Monday session
Google summer of code 2012
Need to reboot your content creation strategy? Start with "No"
Citizen Developer Tools are not just for Citizen Developers (session at Share...
Ai progress = leaderboards compute data algorithms 20180817 v3
Maintainable Machine Learning Products
A design system. A year in review.
AI and the Future of Devs: Nakov @ Techniverse (Nov 2024)
Timeless TechComm Tips - STC New England Interchange Keynote 2020
Engage 2020: Six Polite Ways to Design a RESTful API for Your Application!
Resume
Ad

More from Hideaki Hata (7)

PDF
Same File, Different Changes: The Potential of Meta-Maintenance on GitHub
PDF
Are Donation Badges Appealing?: A Case Study of Developer Responses to Eclips...
PDF
9.6 million links in source code comments: purpose, evolution, and decay
PPTX
Understanding the Heterogeneity of Contributors in Bug Bounty Programs
PPTX
Using High-Rising Cities to Visualize Performance in Real-Time
PDF
Bug or Not? Bug Report Classification using N-Gram Idf
PDF
Bug Prediction Based on Fine-Grained Module Histories
Same File, Different Changes: The Potential of Meta-Maintenance on GitHub
Are Donation Badges Appealing?: A Case Study of Developer Responses to Eclips...
9.6 million links in source code comments: purpose, evolution, and decay
Understanding the Heterogeneity of Contributors in Bug Bounty Programs
Using High-Rising Cities to Visualize Performance in Real-Time
Bug or Not? Bug Report Classification using N-Gram Idf
Bug Prediction Based on Fine-Grained Module Histories
Ad

Recently uploaded (20)

PDF
MaterialX Virtual Town Hall - August 2025
PPTX
StacksandQueuesCLASS 12 COMPUTER SCIENCE.pptx
PDF
Top 10 Project Management Software for Small Teams in 2025.pdf
PDF
OpenColorIO Virtual Town Hall - August 2025
PPT
chapter01_java_programming_object_oriented
PPTX
Independent Consultants’ Biggest Challenges in ERP Projects – and How Apagen ...
PDF
OpenEXR Virtual Town Hall - August 2025
PPTX
oracle_ebs_12.2_project_cutoveroutage.pptx
PPTX
Advanced Heap Dump Analysis Techniques Webinar Deck
PDF
OpenTimelineIO Virtual Town Hall - August 2025
PPTX
Why 2025 Is the Best Year to Hire Software Developers in India
PPTX
Streamlining Project Management in the AV Industry with D-Tools for Zoho CRM ...
PPTX
Relevance Tuning with Genetic Algorithms
PDF
DOWNLOAD—IOBit Uninstaller Pro Crack Download Free
PDF
IObit Driver Booster Pro Crack Latest Version Download
PDF
SBOM Document Quality Guide - OpenChain SBOM Study Group
PDF
How to Write Automated Test Scripts Using Selenium.pdf
PDF
Canva Desktop App With Crack Free Download 2025?
PPTX
Beige and Black Minimalist Project Deck Presentation (1).pptx
PPTX
Comprehensive Guide to Digital Image Processing Concepts and Applications
MaterialX Virtual Town Hall - August 2025
StacksandQueuesCLASS 12 COMPUTER SCIENCE.pptx
Top 10 Project Management Software for Small Teams in 2025.pdf
OpenColorIO Virtual Town Hall - August 2025
chapter01_java_programming_object_oriented
Independent Consultants’ Biggest Challenges in ERP Projects – and How Apagen ...
OpenEXR Virtual Town Hall - August 2025
oracle_ebs_12.2_project_cutoveroutage.pptx
Advanced Heap Dump Analysis Techniques Webinar Deck
OpenTimelineIO Virtual Town Hall - August 2025
Why 2025 Is the Best Year to Hire Software Developers in India
Streamlining Project Management in the AV Industry with D-Tools for Zoho CRM ...
Relevance Tuning with Genetic Algorithms
DOWNLOAD—IOBit Uninstaller Pro Crack Download Free
IObit Driver Booster Pro Crack Latest Version Download
SBOM Document Quality Guide - OpenChain SBOM Study Group
How to Write Automated Test Scripts Using Selenium.pdf
Canva Desktop App With Crack Free Download 2025?
Beige and Black Minimalist Project Deck Presentation (1).pptx
Comprehensive Guide to Digital Image Processing Concepts and Applications

Characteristics of Sustainable OSS Projects:
 A Theoretical and Empirical Study

  • 1. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Characteristics of Sustainable OSS Projects:
 A Theoretical and Empirical Study ! Hideaki Hata, Taiki Todo, Saya Onoue, Kenichi Matumoto 1
  • 2. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 2
  • 3. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Toward Sustainable OSS How can OSS projects attract developers? ! What can OSS projects do to incentivize developers to write code? 3
  • 4. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Context GHTorrent datasets • Top-10 starred software projects for the top programming languages on Github: 90 projects Filtering • More than 3 year histories on Dec. 2012: 22 projects 4 Gousios, MSR 2014 Mining Challenge Dataset in GHTorrent
  • 5. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 SW Population Pyramids • Right: coding contributors • Left: non-coding (comments, issues) contributors 5 Onoue et al., Software population pyramids:
 the current and the future of OSS development, ESEM 2014. Experience Developers 3 months }
  • 6. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Varieties of SPP GHTorrent datasets • Top-10 starred software projects for the top programming languages on Github: 90 projects Filtering • More than 3 year histories on Dec. 2012: 22 projects 6
  • 7. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 akka beanstalkd blueprint−css compass devise django−cms django−debug−toolbar homebrew http−parser jekyll jquerykestrel MaNGOS mongo node openFrameworks paperclip rails redis scalatra ThinkUP tornado 5 10 15 20 5 10 15 20 # of coding bars #ofdiscussionbars Distribution of OSS Projects 7 # of coding bars #ofnon-codingbars
  • 8. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 akka beanstalkd blueprint−css compass devise django−cms django−debug−toolbar homebrew http−parser jekyll jquerykestrel MaNGOS mongo node openFrameworks paperclip rails redis scalatra ThinkUP tornado 5 10 15 20 5 10 15 20 # of coding bars #ofdiscussionbars Distribution of OSS Projects 8 # of coding bars #ofnon-codingbars
  • 9. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 akka beanstalkd blueprint−css compass devise django−cms django−debug−toolbar homebrew http−parser jekyll jquerykestrel MaNGOS mongo node openFrameworks paperclip rails redis scalatra ThinkUP tornado 5 10 15 20 5 10 15 20 # of coding bars #ofdiscussionbars Distribution of OSS Projects 9 # of coding bars #ofnon-codingbars (a) (c) (b) (d)
  • 10. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Introducing
 game theory 10
  • 11. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Game Theoretical Model A leader-follower game • Project: keep (K) or setup (S) • Developer: write code (C) or non-coding contribution (discussion, D) 11 developer project
  • 12. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Results of Equilibrium Analysis:! Incentivize Developers to Write Code • Setup: To increase the utility of writing code compared to the utility of just non-coding contributions, projects need to setup the development environment, which can decrease the cost of writing code. • Mandatory: Employment is a big incentive to write code. The project itself or other third-parties can select this option. • Innovation: Innovations can decrease the cost and may increase the reward. 12
  • 13. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Empirical Analysis 13
  • 14. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Coverage of Setup 14 TABLE I COVERAGE OF SETUP Area Project Wiki Website How to Contribute Coding Guideline Multi-Language Document # of yes (b) rails no yes yes no no 2 jekyll yes yes yes no no 3 django-cms no yes no no no 1 jquery yes yes yes yes no 4 paperclip yes yes yes no yes 4 homebrew yes yes yes yes yes 5 node yes yes yes no no 3 tornade yes yes no no no 2 devise yes yes yes no no 3 redis yes yes no no no 2 openFrameworks yes yes no no no 2 compass yes yes yes yes no 4 (d) mongo no yes yes yes no 3 akka no yes no no no 1 (a) ThinkUP yes yes no no no 2 django-debug-toolbar yes yes no no no 2 http-parser no no no no no 0 beanstalkd yes yes no no no 2 MaNGOS no yes no no no 1 kestrel yes yes no no no 2 (c) scalatra no yes no no no 1 blueprint-css yes yes no yes no 3 hosting service and social networking system for developers, have balanced software population pyramids with many coding
  • 15. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Employment 15 akka beanstalkd blueprint−css compass devise django−cms django−debug−toolbar homebrew http−parser jekyll jquerykestrel MaNGOS mongo node openFrameworks paperclip rails redis scalatra ThinkUP tornado 5 10 15 20 5 10 15 20 # of coding bars #ofdiscussionbars # of coding bars #ofnon-codingbars 25 paid in 116 12 paid in 34
  • 16. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Innovation: Impact of GitHub 16 1 year 2 years 3 years 4 years 5 years 6 years 7 years 8 years 5 0 5 2007/12 1 year 2 years 3 years 4 years 5 years 6 years 7 years 8 years 200 100 0 100 200 2008/12 1 year 2 years 3 years 4 years 5 years 6 years 7 years 8 years 200 100 0 100 200 2009/12 1 year 2 years 3 years 4 years 5 years 6 years 7 years 8 years 200 100 0 100 200 2010/12 1 year 2 years 3 years 4 years 5 years 6 years 7 years 8 years 700 350 0 350 700 2011/12 1 year 2 years 3 years 4 years 5 years 6 years 7 years 8 years 700 350 0 350 700 2012/12 ategory coding discusscussion movedsion moveddiscussion ategory codingmovediscussioncoding discussionscussion moved Fig. 7. The transition of software population pyramids of the rails project. This project moved to GitHub on 2008/4 C. The Impact of Innovation the analysis to a bit different situations, such as a new market with a slight modified pricing rules and/or a social laws. Rails moved to GitHub on August 2008
  • 17. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Summary To attract and retain coding contributors • Prepare documents (setup) • Have paid developers (employment) • Adopt new technologies/environment (innovation) 17
  • 18. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Discussions • Limitations in theoretical analysis • Bounded rationality. Humans are not rational in general • Too much simplification • Threats to validity in empirical analysis • Limited datasets • Analysis result may have error 18
  • 19. Hideaki Hata, Characteristics of Sustainable OSS Projects @ CHASE 2015 Future Directions • Integration of theory and empirical analysis is strong approach for • understanding human behaviors • designing desirable environments 19