Parameter Estimation of Conditional Random Fields Model By Improved Particle Swarm Optimizer

doi:10.4304/jcp.6.8.1628-1633

Journal of Computers, Vol 6, No 8 (2011), 1628-1633, Aug 2011

doi:10.4304/jcp.6.8.1628-1633

Parameter Estimation of Conditional Random Fields Model By Improved Particle Swarm Optimizer

Zengfa Dou, Lin Gao

Abstract

A new parameter estimation algorithm based on improved particle swarm optimizer is proposed to improve the precision and recall rate of conditional random fields model. Aggregation degree of particle swarm is utilized to control particle swarm optimizer’s early local convergence, the relative change ratio of log-likelihood between iterations is employed to end its iterations, and the inertia factor and learning factor are set as linear variables to control the searching scope. We evaluate our method on GENIA, GENETAG and private library. The experiment results prove our method outperforms traditional parameter estimation method on precision and recall.

Keywords

Conditional Random Fields Model; Particle Swarm Optimizer; Parameter Estimation; Aggregation degree of particle swarm; Relative change ratio of log-likelihood

References

[1] AR Kinjo, F Rossello, G Valiente, Profile Conditional Random Fields for Modeling Protein Families with Structural Information, BIOPHYSICS, Vol 5,2009, pp.37-44.
http://dx.doi.org/10.2142/biophysics.5.37

[2] Brill E, Transformation-based errordriven learning and natural language processing: A case study in part-of-speech tagging, Comput. Linguistics, Vol. 21(4), pp.543–565.

[3] Brill E, Processing natural language without natural language processing, in Computational Linguistics and Intelligent Text Processing, Proceedings, Lecture Notes in Computer Science, Vol. 2588, Gelbukh, A. F., Ed, Springer, pp. 360–369.
http://dx.doi.org/10.1007/3-540-36456-0_37

[4] Brill E and Mooney RJ, An overview of empirical natural language processing, AI Magazine, Vol. 18(4), pp. 13–24.

[5] B Settles, Biomedical named entity recognition using conditional random fields and novel feature sets, In Proceedings of the Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA-2004), pp. 104–107.

[6] Chang JT, Schutze H and Altman RB, GAPSCORE: Finding gene and protein names one word at a time, Bioinformatics, Vol. 20(2), 2004, pp. 216–225.
http://dx.doi.org/10.1093/bioinformatics/btg393
PMid:14734313

[7] Charles A Sutton, Efficient Training Methods For Conditional Random Fields, PhD thesis, University of Massachusetts Amherst, February 2008.

[8] Chen L and Friedman C,Extracting phenotypic information from the literature via natural language processing, in Proceedings of the 11th World Congress on Medical Informatics, IMIA, San Francisco, CA, 2004,pp. 758–762.

[9] Chris Pal, Charles Sutton, and Andrew McCallum, Sparse forward backward using minimum divergence beams for fast training of conditional random Fields, In International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2006,pp.v-v.

[10] C Sutton, A McCallum, Piecewise pseudolikelihood for efficient training of conditional random fields, Proceedings of the 24th international conference on Machine learning, 2007,pp. 863-870.

[11] Erik F, Tjong Kim Sang and Fien De Meulder, Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition, In Walter Daelemans and Miles Osborne, editors, Proceedings of CoNLL-2003, Edmonton, Canada, 2003.

[12] Fletcher R and C M Reeves, Function Minimization by Conjugate Gradients, Comp.J, 7,1964,pp. 149-154.
http://dx.doi.org/10.1093/comjnl/7.2.149

[13] Hanisch D, Fluck J, Mevissen HT and Zimmer R, Playing biology’s name game: Identifying protein names in scientific text, in Proceedings of the 8th Pacific Symposium on Biocomputing, 3rd–7th January,2003, Hawaii, pp. 403–414.

[14] Han-Shen Huang, Yu-Ming Chang, Chun-Nan Hsu, Training Conditional Random Fields by Periodic Step Size Adaptation for Large-Scale Text Mining, Seventh IEEE International Conference on Data Mining, 2007,pp. 511-516.

[15] J Lafferty, A McCallum, F Pereira, Conditional random fields: Probabilistic models for segmenting and labeling sequence data, Proc. 18th International Conf On Machine Learning, pp. 282-289.

[16] J Kennedy and RC Eberhart, Particle Swarm Optimization, Proc. on feedback mechanism IEEE Int’l. Conf. on Neural Networks, vol. VI, IEEE Service Center, 1995,pp. 1942-1948.

[17] Kim JD, Ohta T, Tateisi Y and Tsujii J, GENIA corpus – a semantically annotated corpus for bio-textmining, Bioinformatics, Vol. 19, Suppl. 1, 2003, pp. i180–182.
http://dx.doi.org/10.1093/bioinformatics/btg1023
PMid:12855455

[18] Lin Liao, Tanzeem Choudhury, Dieter Fox, Henry Kautz, Training Conditional Random Fields using Virtual Evidence Boosting, In Proc. of the International Joint Conference on Artificial Intelligence (IJCAI), 2007,pp.2530-2535.

[19] Lovbjerg M, Rasmussen T K, Krink T, Hybrid particle swarm optimization with breeding and subpopulations, Proc of the third Genetic and Evolutionary computation conference, San Francisco, USA, 200l,pp. 469-476.

[20] M Al-Baali, Descent property and global convergence of the Fletcher-Reeves method with inexact line search, IMA Journal of Numerical Analysis, Vol 5,1985, pp.121-124.
http://dx.doi.org/10.1093/imanum/5.1.121

[21] M Bundschus, M Dejori, M Stetter, V Tresp, HP Kriegel, Extraction of semantic biomedical relations from text using conditional random fields, BMC Bioinformatics 9 (2008): 207.
http://dx.doi.org/10.1186/1471-2105-9-207
PMid:18433469 PMCid:2386138

[22] Mika S and Rost B, Protein names precisely peeled off free text, Bioinformatics, Vol. 20, Suppl. 1, 2004, pp. I241–247.
http://dx.doi.org/10.1093/bioinformatics/bth904
PMid:15262805

[23] M Mahdaviani, T Choudhury, Fast and Scalable Training of Semi-Supervised CRFs with Application to Activity Recognition, Advances in Neural Information Processing Systems, 2007

[24] Narayanaswamy M, Ravikumar KE and Vijay-Shanker K, A biological named entity recognizer, in Proceedings of the 8th Pacific Symposium on Biocomputing, 3rd–7th January 2003, Hawaii, pp. 427–438.

[25] R Eberhart, J Kennedy, A new optimizer using particle swarm theory, Proc. 6th Int. Symposium on Micro Machine and Human Science, IEEE, 1995,pp. 39-43.

[26] Richard H Byrd, Peihuang Lu, Jorge Nocedal and Ciyou Zhu, A Limited Memory Algorithm For Bound Constrained Optimization, SIAM Journal on Scientific Computing, Volume 16, 1995, pp. 1190-1208.

[27] R Malouf, A comparison of algorithms for maximum entropy parameter estimation, In Proceedings of The Sixth Conference on Natural Language Learning (CoNLL-2002), 2002,pp. 49–55.

[28] Settles B, Biomedical named entity recognition using conditional random fields and rich feature sets, in Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications (NLPBA), Geneva, Switzerland, 2004.

[29] SS Gross, O Russakovsky, CB Do, S Batzoglou, Training Conditional Random Fields for Maximum Labelwise Accuracy, Advances in Neural Information Processing Systems, 2007,pp. 529-536.

[30] SVN Vishwanathan, NN Schraudolph, MW Schmidt, KP Murphy, Accelerated training of conditional random fields with stochastic gradient methods, Proceedings of the 23 rd International Conference on Machine Learning, Pittsburgh, PA, 2006,pp. 969-976.

[31] Tanabe L and Wilbur WJ, Tagging gene and protein names in biomedical text, Bioinformatics, Vol. 18(8), 2002, pp. 1124–1132.
http://dx.doi.org/10.1093/bioinformatics/18.8.1124
PMid:12176836

[32] Thomas G Dietterich, Guohua Hao, Adam Ashenfelter, Gradient Tree Boosting for Training Conditional Random Fields, Journal of Machine Learning Research 9, 2008, pp.2113-2139.

[33] Van den Bergh F Engelbrecht AP, Training product unit networks using cooperative particle swarm optimizers, Proc of the third Genetic and Evolutionary computation conference, San Francisco, USA, 200l,pp. 126-131.

[34] Xiaoxu Ma, W Eric, L Grimson, Learning Coupled Conditional Random Field for Image Decomposition with Application on Object Categorization, IEEE Conference on Computer Vision and Pattern Recognition, June 2008,pp. 1-8.
http://dx.doi.org/10.1109/CVPR.2008.4587593

[35] Y Guangyou. A modifed particle swarm optimizer algorithm. 8th International Conference on Electronic Measurement and Instruments, ICEMI '07,2007, pp.2-675-2-679

[36] YH Dai and Y Yuan, Convergence properties of the Fletcher-Reeves method, IMA Journal of Numerical Analysis 16, 1996, pp. 155-164.
http://dx.doi.org/10.1093/imanum/16.2.155

[37] Y Hifny, S Renals, Speech Recognition using Augmented Conditional Random Fields, IEEE Transactions on Audio, Speech, and Language Processing, Volume17, 2009,pp. 354-365.
http://dx.doi.org/10.1109/TASL.2008.2010286

[38] Y Xiong, J Zhu, H Huang, H Xu, Minimum tag error for discriminative training of conditional random fields, Information Sciences, 2009,pp. 169-179.
http://dx.doi.org/10.1016/j.ins.2008.09.018

[39] Zhou G, Zhang J, Su J et al, Recognizing names in biomedical texts: A machine learning approach, Bioinformatics, Vol. 20(7), 2004,pp. 1178–1190.
http://dx.doi.org/10.1093/bioinformatics/bth060
PMid:14871877

Full Text: PDF

Journal of Computers (JCP, ISSN 1796-203X)

Username
Password
Remember me