Online Game QoE Evaluation using Paired Comparisons

[1] "Alien Arena." [Online]. Available: http://icculus.org/alienarena/rpa/aquire.html

[2] "Halo Combat Evolved." [Online]. Available: http://halo.wikia.com/wiki/Halo:_Combat_Evolved

[3] "Unreal Tournament 2004." [Online]. Available: http://www.unrealtournament2003.com/ut2004/

[4] G. Armitage, "An experimental estimation of latency sensitivity in multiplayer quake 3," in The 11th IEEE International Conference on Networks, 2003, pp. 137-141.

[5] G. Armitage and L. Stewart, "Limitations of using real-world, public servers to estimate jitter tolerance of first person shooter games," in Proceedings of ACM SIGCHI ACE 2004 Conference, 2004, pp. 257-262.

[6] N. E. Baughman and B. N. Levine, "Cheat-proof playout for centralized and distributed online games," in Proceedings of IEEE INFOCOM 2001, Anchorage, AK, Apr. 2001.

[7] T. Beigbeder, R. Coughlan, C. Lusher, J. Plunkett, E. Agu, and M. Claypool, "The effects of loss and latency on user performance in Unreal Tournament 2003," in Proceedings of NetGames'04. ACM Press, 2004, pp. 144-151.

[8] R. A. Bradley and M. E. Terry, "Rank analysis of incomplete block designs: I. the method of paired comparisons," Biometrika, vol. 39, no. 3/4, pp. 324-345, 1952.

[9] K.-T. Chen, P. Huang, and C.-L. Lei, "Effect of network quality on player departure behavior in online games," IEEE Transactions on Parallel and Distributed Systems, vol. 20, no. 5, pp. 593-606, May 2009.

[10] K.-T. Chen, C.-C. Wu, Y.-C. Chang, and C.-L. Lei, "A Crowdsourceable QoE Evaluation Framework for Multimedia Content," in Proceedings of ACM Multimedia 2009, 2009.

[11] S. Choisel and F. Wickelmaier, "Evaluation of multichannel reproduced sound: Scaling auditory attributes underlying listener preference," The Journal of the Acoustical Society of America, vol. 121, no. 1, pp. 388-400, 2007.

[12] H. A. David, The Method of Paired Comparisons. Oxford University Press, 1988.

[13] R. Dittrich, R. Hatzinger, and W. Katzenbeisser, "Modelling the effect of subject-specific covariates in paired comparison studies with an application to university rankings," Journal of the Royal Statistical Society (Series C): Applied Statistics, vol. 47, no. 4, pp. 511-525, 1998.

[14] T. Henderson, "Latency and user behaviour on a multiplayer game server," in Proceedings of the Third International COST264 Workshop on Networked Group Communication. Springer-Verlag, 2001, pp. 1-13.

[15] T. Henderson and S. Bhatti, "Modelling user behaviour in networked games," in Proceedings of the ninth ACM international conference on Multimedia. ACM, 2001, pp. 212-220.

[16] C. L. Knott and M. S. James, "An alternate approach to developing a total celebrity endorser rating model using the analytic hierarchy process," International Transactions in Operational Research, vol. 11, no. 1, pp. 87-95, 2004.

[17] R. D. Luce, Individual Choice Behavior: A Theoretical Analysis. New York: Wiley, 1959.

[18] J. N. S. Matthews and K. P. Morris, "An application of bradley-terry-type models to the measurement of pain," Applied Statistics, vol. 44, pp. 243-255, 1995.

[19] M. Oliveira and T. Henderson, "What online gamers really think of the internet?" in Proceedings of the 2nd workshop on Network and system support for games. ACM, 2003, pp. 185-193.

[20] L. Pantel and L. Wolf, "On the suitability of dead reckoning schemes for games," in Proceedings of NetGames'09, 2002, pp. 79-84.

[21] N. L. Powers and R. M. Pangborn, "Paired comparison and time-intensity measurements of the sensory properties of beverages and gelatins containing sucrose or synthetic sweeteners," Journal of Food Science, vol. 43, no. 1, pp. 41-46, 1978.

[22] P. Quax, P. Monsieurs, W. Lamotte, D. D. Vleeschauwer, and N. Degrande, "Objective and subjective evaluation of the influence of small amounts of delay and jitter on a recent first person shooter game," in Proceedings of ACM SIGCOMM 2004 workshops on NetGames '04. ACM Press, 2004, pp. 152-156.

[23] P. Rossi, Z. Gilula, and G. Allenby, "Overcoming Scale Usage Heterogeneity: A Bayesian Hierarchical Approach," Journal of the American Statistical Association, vol. 96, no. 453, pp. 20-31, 2001.

[24] T. L. Saaty, "A scaling method for priorities in hierarchical structures," Journal of Mathematical Psychology, vol. 15, no. 3, pp. 234-281, 1977.

[25] N. Sheldon, E. Girard, S. Borg, M. Claypool, and E. Agu, "The effect of latency on user performance in Warcraft III," in Proceedings of NetGames'03. ACM Press, 2003, pp. 3-14.

	T₁	T₂	T₃	T₄
T₁	-	a₁₂	a₁₃	a₁₄
T₂	a₂₁	-	a₂₃	a₂₄
T₃	a₃₁	a₃₂	-	a₃₄
T₄	a₄₁	a₄₂	a₄₃	-

Settings	# Comparisons

0 ms, 200 ms, 400 ms,	uplink	600
600 ms, 800 ms, 1000 ms	downlink	690
	uplink	288
	downlink	252
	uplink	72
	downlink	78

Game	Factor	Link	WST	MST	SST	Kendall	p-BTL

		uplink	0	2	7	0.35	0.33
		downlink	0	0	3	0.62	0.80
		uplink	0	0	1	0.40	0.12
		downlink	0	0	0	0.74	0.86
		uplink	0	1	1	-0.04	0.26
		downlink	0	0	0	1.00	0.02
		uplink	0	0	6	0.36	0.35
		downlink	2	2	5	0.53	0.15
		uplink	0	0	0	0.25	0.98
		downlink	0	1	3	0.22	0.03
		uplink	0	0	0	0.55	0.44
		downlink	0	1	1	0.32	0.05
		uplink	0	1	10	0.42	0.01
		downlink	0	2	5	0.56	0.11
		uplink	0	1	1	0.28	0.01
		downlink	0	0	2	0.43	0.17
		uplink	0	0	0	1.00	0.02
		downlink	0	0	0	-0.08	1.00

Online Game QoE Evaluation using Paired Comparisons

Abstract

1 Introduction

2 Related Work

2.1 Game QoE Studies

2.2 Studies based on Paired Comparisons

3 Probabilistic Choice Modeling

4 Experiment Methodology

5 Experiment Results

5.1 Data Summary

5.2 Effect of Network Delay

5.3 Effect of Network Loss

5.4 Effect of Delay Jitter

6 Conclusion and Future Work

References

Footnotes:

	Generalizable	Judgement difficulty	Ratio-scale scores	Input verifiable

Paired comparison	yes	low	yes	yes
MOS ratings	yes	high	no	no
Objective performance	no	N/A	no	no