Study Notes on the Success of Online Communities
Are All Successful Communities Alike? Characterizing and Predicting the Success of Online Communities
Authors and Contact Information
Tiago Cunha, University of Michigan (tiolivei@umich.edu)
David Jurgens, University of Michigan (jurgens@umich.edu)
Chenhao Tan, University of Colorado Boulder (chenhao.tan@colorado.edu)
Daniel M. Romero, University of Michigan (drom@umich.edu)
Abstract
Growth of online communities provides opportunities to examine mechanisms explaining group success.
Most research defines community success through a single measure, mainly focusing on the number of members.
Proposal of multiple definitions of success is essential for comprehensive understanding.
Identification of four core success measures:
Growth in the number of members.
Retention of members.
Long-term survival of the community.
Volume of activities within the community.
Findings reveal low correlations among these measures, suggesting distinct types of success that require differing attributes and behaviors.
Practical implications concern the design of new online communities and how platforms facilitate these communities.
Keywords
Online Communities, Success, Group Dynamics, Reddit
Introduction
Importance of Understanding Community Success
Essential for building and maintaining vibrant online communities.
Enables user connections in shared interest areas.
Not all created communities achieve success; major datasets enable large-scale dynamic studies of community life cycles.
Problem Statement
Can we predict a community's success early based on attributes measured in its early lifetime?
Existing research often narrowly defines success based on user participation numbers.
Success metrics can vary significantly by community type (e.g., a social group versus a cause-based community).
Hypothesis
The richness and complexity of online communities necessitate a variety of success measures to understand community achievement better.
Community Success Measures
Proposed Success Measures
Growth in the number of members: Total new users within a year after the community reaches a minimum size (k).
Distinction between growth in commenters (Gcom) and posters (Gposters).
User retention: Average monthly retention rate within the first year after attaining k users (R).
Long-term survival: Percentage of activity in the last three months of a 24-month time frame after hitting k.
Volume of activities: Average number of posts and comments in the first year after reaching k members.
Measuring Community Characteristics
Formalization of measures through specific metrics involving user engagement levels for both posts and comments.
ResulCorrelation Between Success Metrics
Findings on Success Metric Correlation
Pairwise Spearman’s correlations across metrics reveal positive relationships but low correlation values.
Highlights diversity in metrics, showing that high numbers in one measure do not guarantee high values in others (e.g., growth does not necessarily indicate retention).
Specific examples illustrate varying performance across different community goals and types.
Characterizing Community Success
Defining Community Success
Emphasizes that community success varies widely between groups, influenced by nature and purpose.
A battery of studies supports the notion that successful communities may exhibit different desirable characteristics.
Detailed Success Definitions
Success measured over a fixed timeframe after k users join:
Growth:
G{com} = | igcup{i=1}^{12} U{com}^{(i)} | G{posters} = | igcup{i=1}^{12} U{posts}^{(i)} |Retention:
R = rac{1}{12} imes igg( rac{ig| U{i o i+1} ig|}{|U{all}^{(i)}|} igg)Survival:
Volume of activities:
Average posts:
rac{1}{12} imes igg( ext{Total } T_{posts}^{(i)} igg)Average comments:
rac{1}{12} imes igg( ext{Total } T_{comments}^{(i)} igg)
Implications of Findings
Communities exhibit unique success metrics; success cannot be captured through a singular definition or measurement.
The heterogeneity in community behaviors offers insights into effective strategies for community organizers.
Predictive Features of Success
Feature Classification
Six categories of features predicting success based on established research norms:
Volume and Speed of Activities: Analyzes the number of posts, comments, and activity timing to predict future growth and activity.
Distribution of Activities: Measures Gini coefficient for content distribution across community members.
User Composition: Prior user activity affects future success; considers experience levels in new members based on past statistics.
Linguistic Style: Evaluates interaction language, sentiment, and adherence to community norms related to user satisfaction.
Social Network Features: Constructs communication graphs regarding user interaction metrics.
Parent Communities: Studies the genealogical structure of communities related to early adopters.
Application of Features in Predictions
Utilized logistic regression models to interpret success measures from the set of predictive features, revealing community engagements’ influence on various success aspects.
Results and Interpretation
Prediction Accuracy
Predicted success with AUC scores ranging from 0.72 to 0.84 depending on characteristics fit for particular measures.
Growth in community commenters and user retention seen as generally easier to predict versus survival metrics.
Analysis of feature effectiveness revealed key insights into which attributes most affect community dynamics and success rates.
Design Implications for Community Organizers
Community Founding Strategies
Early active participation (even from a small core group) correlates positively with long-term success.
Recommendations to maintain steady comment and discussion rates to foster ongoing engagement and participation, preventing burnout and stagnation.
Limitations of Study
The study’s control constraints concerning community topics and their associated success patterns.
The potential gap in applying findings in real-world community-building scenarios due to the observational nature of the study.
Conclusion
Diverse pathways to community success require a comprehensive understanding of user behaviors and engagement strategies.
Core findings emphasize the important role that early community actions have in forecasting their potential longevity and success across multiple dimensions.
Acknowledgments
Supported by National Science Foundation under Grant No. IIS-1617820.
References
Extensive reference list from various in-depth studies demonstrates the breadth of research interconnecting community dynamics, success, and prediction methods.