Trajectory Analysis for User Verification and Recognition tnoteref{grant}

[1] Abdul Rahim Ahmad, M. Khalia, C. Viard-Gaudin, and E. Poisson. Online handwriting recognition using support vector machine. In Second International Conference on Artificial Intelligence in Engineering and Technology, volume A, pages 311-314, nov. 2004.

[2] Kuan-Ta Chen, Andrew Liao, Hsing-Kuo Pao, and Hao-Hua Chu. Game Bot Detection Based on Avatar Trajectory. In Proceedings of IFIP ICEC, pages 94-105, 2008.

[3] Kuan-Ta Chen, Hsing-Kuo Pao, and Hong-Chung Chang. Game Bot Identification based on Manifold Learning. In Proceedings of ACM NetGames, pages 21-26, October 2008.

[4] Trevor F. Cox and Michael A. A. Cox. Multidimensional Scaling, Second Edition. Chapman & Hall/CRC, 2000.

[5] Hamido Fujita, Jun Hakura, and Masaki Kurematu. Intelligent human interface based on mental cloning-based software. Know.-Based Syst., 22:216-234, April 2009.

[6] John S. Gero and Wei Peng. Understanding behaviors of a constructive memory agent: A markov chain analysis. Know.-Based Syst., 22:610-621, December 2009.

[7] Steven Gianvecchio, Zhenyu Wu, Mengjun Xie, and Haining Wang. Battle of botcraft: fighting bots in online games with human observational proofs. In Ehab Al-Shaer, Somesh Jha, and Angelos D. Keromytis, editors, ACM Conference on Computer and Communications Security, pages 256-268. ACM, 2009.

[8] Chien-Ju Ho, Chen-Chi Wu, Kuan-Ta Chen, and Chin-Luang Lei. DevilTyper: A Game for CAPTCHA Usability Evaluation. ACM Computers in Entertainment, 9(1):3:1-3:14, April 2011.

[9] Anil K. Jain, Patrick Flynn, and Arun A. Ross. Handbook of Biometrics. Springer-Verlag New York, Inc., Secaucus, NJ, USA, 2007.

[10] Anil K. Jain, Friederike D. Griess, and Scott D. Connell. On-line signature verification. Pattern Recognition, 35(12):2963-2972, 2002.

[11] Eamonn Keogh, Stefano Lonardi, and Chotirat Ann Ratanamahatana. Towards parameter-free data mining. In KDD '04: Proceedings of the tenth ACM SIGKDD inter. conf. on Knowledge discovery and data mining, pages 206-215, New York, NY, USA, 2004. ACM.

[12] Jae-Gil Lee, Jiawei Han, Xiaolei Li, and Hector Gonzalez. TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering. PVLDB, 1(1):1081-1094, 2008.

[13] Yuh-Jye Lee and O. L. Mangasarian. SSVM: A smooth support vector machine for classification. Comput. Optim. Appl., 20(1):5-22, 2001.

[14] M. Li, J. H. Badger, X. Chen, S. Kwong, P. Kearney, and H. Zhang. An information-based sequence distance and its application to whole mitochondrial genome phylogeny. Bioinformatics, 17(2):149-154, 2001.

[15] M. Li and P. Vitányi. An Introduction to Kolmogorov Complexity and Its Applications (2nd Ed.). Springer, New York, 1997.

[16] Fengyi Lin, Ching-Chiang Yeh, and Meng-Yuan Lee. The use of hybrid manifold learning and support vector machines in the prediction of business failure. Knowledge-Based Systems, 24(1):95 - 101, 2011.

[17] Jessica Lin, Eamonn J. Keogh, Stefano Lonardi, and Bill Yuan-Chi Chiu. A symbolic representation of time series, with implications for streaming algorithms. In DMKD, pages 2-11, 2003.

[18] Pietro Liò and Nick Goldman. Models of molecular evolution and phylogeny. Genome Res, 8:1233-1244, 1998.

[19] Ronald Metoyer, Simone Stumpf, Christoph Neumann, Jonathan Dodge, Jill Cao, and Aaron Schnabel. Explaining how to play real-time strategy games. Know.-Based Syst., 23:295-301, May 2010.

[20] Greg Mori and Jitendra Malik. Recognizing objects in adversarial clutter: Breaking a visual CAPTCHA. In Computer Vision and Pattern Recognition, volume 1, pages 134-141, Los Alamitos, CA, USA, 2003. IEEE Computer Society.

[21] Mario E. Munich and Pietro Perona. Visual identification by signature tracking. IEEE Trans. Pattern Anal. Mach. Intell., 25(2):200-217, 2003.

[22] Hsing-Kuo Pao and John Case. Computing entropy for ortholog detection. In International Conference on Computational Intelligence, pages 89-92, 2004.

[23] Hsing-Kuo Pao, Kuan-Ta Chen, and Hong-Cheng Chang. Game bot detection via avatar trajectories analysis. IEEE Transactions on Computational Intelligence and AI in Games, 2(3):162-175, September 2010.

[24] Hsing-Kuo Pao, Hong-Yi Lin, Kuan-Ta Chen, and Junaidillah Fadlil. Trajectory based Behavior Analysis for User Verification. In IDEAL, pages 315-322, 2010.

[25] Yu Qiao, Jianzhuang Liu, and Xiaoou Tang. Offline signature verification using online handwriting registration. In CVPR, 2007.

[26] Jonas Richiardi and Andrzej Drygajlo. Gaussian mixture models for on-line signature verification. In WBMA '03: Proceedings of the 2003 ACM SIGMM workshop on Biometrics methods and applications, pages 115-122, New York, NY, USA, 2003. ACM.

[27] Bruce Schneier. Two-factor authentication: too little, too late. Commun. ACM, 48:136, April 2005.

[28] C. E. Shannon. A mathematical theory of communication. Bell Syst Tech. J., 27:379-423, 1948.

[29] J. B. Tenenbaum, V. de Silva, and J. C. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319-2323, December 2000.

[30] Luis von Ahn, Manuel Blum, Nicholas J. Hopper, and John Langford. CAPTCHA: Using hard AI problems for security. In EUROCRYPT, pages 294-311, 2003.

Footnotes:

1. We use equally-spaced time stamps in this work. In practice, we sample one point for each second in this work.

2. It will be discussed in Section 4.

3. Motion pattern detection usually needs some devices. For instance, we need devices to catch the emotional states from facial and voice patterns in Fujita et al. [5].

4. This work gives more systematic studies on more types of trajectories than the work in [24]; moreover, we clarify the difference between the verification and recognition tasks and give more studies on the recognition task on this work.

5. The Markov chain (MC) model is popular in many learning tasks; for example, Shannon [28] uses it to model English text; Liò et al. [18] use it to model the DNA base substitution and amino acid replacement for phylogenetic reconstruction; and Gero et al. use it to model constructive memory [6].

6. The p(x1) is a prior that assumes a uniform distribution, so it can be ignored in the computation of the maximum likelihood. We also assume that p(x2 | x1) is a 2-D isotropic Gaussian [1/(√{2π}σλ )]exp{−(λ1−[ˉ(λ)] )2. /(2σλ 2. )}, centered in the origin, where [ˉ(λ)] is the mean of step size λt in the trajectory.

7. The smaller the value, the closer will be the relationship between s1 and s2.

8. The concatenation order, such as producing s12 by concatenating s1 followed by s2, or vice versa, makes very little difference. Only the concatenation point makes a difference.

9. The distribution is in a discrete form when we collect the data. Here, discretization means that we combine several bins to form one group or we split bins into several smaller bins, depending on the discretization parameter. We use capital P to denote that it is a probability mass function for the entropy computation.

10. In this work, the input trajectories that we compute their dissimilarities are always in the same length.

11. http://www.idsoftware.com/

12. http://arton.cunst.net/quake/crbot/

13. R. R. Feltrin. Eraser bot 1.01

14. http://ice.planetquake.gamespy.com/

15. http://www.cse.ust.hk/svc2004/download.html

16. http://www.fs.fed.us/pnw/starkey/index.shtml. Wisdom, Michael J. 1988. "The Starkey Project: deer and elk research for the future". Oregon Chapter, The Wildlife Society, Pendleton, OR., U.S.A.

17. In the SSVM classification, we adopt a hierarchical approach to solve the multi-class classification problem. The elk and deer are combined to form one group for the first binary classification, followed by another binary classification to separate them.

18. It is difficult, especially when the number of individuals in the database is large, e.g., thousands or millions of individuals. Although identifying the true account owner based purely on the trajectory input without any other information is difficult, the proposed method provides a possible solution.

Sheng-Wei Chen (also known as Kuan-Ta Chen)
http://www.iis.sinica.edu.tw/~swc
Last Update September 28, 2019


(a) On-line game: human	(b) On-line game: bot

(c) Mouse trace: a left-handed user	(d) Mouse trace: a right-handed user


(a1) t=800	(a2) t=500	(a3) t=300

(b)	(c)	(d)

Name	Classes	Instances	Trace	k_Iso	Intrinsic	Length
			Length		Dim.	Threshold
Handwriting	11	110	702	7	5	200
Mouse	14	217	16665	6	5	300
Game_v	94	940	1000	8	5	300
Game _r	4	173	1000	5	5	300
Animal	3	101	145	5	5	160

Data Set	Training Error	Test Error
Handwriting	1.18	3.91
Mouse	6.67	8.10
Game	10.80	15.62

Data Set	Position	Angle	SC	OC	QLT	Ours	Ours
						Training	Test
Set 1	13.6	6.5	7.2	5.8	7.3	3.29	4.21
Set 2	11.9	6.3	4.9	4.6	7.4	2.42	4.93

Trajectory Analysis for User Verification and Recognition

Abstract

1 Introduction

2 Related Work

2.1 Account Security

2.1.1 Traditional Approach

2.1.2 Handwritten Signature and Other Biometrics

2.1.3 CAPTCHA

2.2 Trajectory Analysis

3 Proposed Method

3.1 Feature Extraction

3.2 Dissimilarity Measures

3.3 Preprocessing by Partition and Alignment

3.3.1 Trajectory Partition

3.3.2 Trajectory Alignment

3.4 Trajectory Representation and Labeling

4 Experiments

4.1 Data Description

4.1.1 Game Trajectory

4.1.2 Handwritten Signature

4.1.3 Mouse Movement Trajectory

4.1.4 Animal Trajectory

4.2 Verification

Using Trajectories of Different Length

4.3 Recognition

4.3.1 Game Trajectory

4.3.2 Animal Trajectory

4.4 Computation Time

5 Conclusion

References

Footnotes:

Trace Length	Training Error	Test Error
500 seconds	5.29	7.07
1000 seconds	1.89	2.53

	Our Method		TraClass
Data Set	Training Error	Test Error	Test Error
Animal	7.53	12.51	16.70