Improving Text Mining Methods in Market Prediction via Prototype Selection Algorithms

Niknam, Farzad; Niknafs, Aliakbar

doi:10.22059/jitm.2016.57234

فهرست نشریات

دوره ویراستاری

دوره آموزشی بهره‌گیری کاربردی از هوش مصنوعی در نگارش، داوری، ویرایش و انتشار آثار علمی برگزار می‌شود

دوره تخصصی آموزش ویراستاری برگزار می‌شود.

فهرست نشریات دارای اعتبار وزارت علوم، تحقیقات و فناوری

فهرست مجلات علمی- پژوهشی دانشگاه تهران

نحوه ارسال مقاله برای مجله- ثبت نام در سامانه- فراموش کردن رمز عبور

تعداد نشریات	127
تعداد شماره‌ها	7,182
تعداد مقالات	77,149
تعداد مشاهده مقاله	156,476,737
تعداد دریافت فایل اصل مقاله	118,028,688

	Improving Text Mining Methods in Market Prediction via Prototype Selection Algorithms
Journal of Information Technology Management
مقاله 74، دوره 8، شماره 2، مهر 2016، صفحه 415-434 اصل مقاله (403.27 K)
نوع مقاله: Research Paper
شناسه دیجیتال (DOI): 10.22059/jitm.2016.57234
نویسندگان
Farzad Niknam^* ¹؛ Aliakbar Niknafs²
¹MSc. Student, Department of Computer Engineering, Faculty Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
²Assistant Prof., Department of Computer Engineering, Faculty Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
چکیده
Nowadays, researches are faced with large volumes of data. Since a considerable amount of them are unstructured, they cannot be processed naturally. Hence two main challenges in this field are high dimensional of features space and bulk of available data. In this research, a feature selection method based on target features is propose to handle the curse of dimensionality. Moreover, to address the huge volume of data some of prototype selection approaches are utilized. The proposed method in this paper has three essential steps that each step improves the previous ones. Although, the proposed method reached significant results in each phase separately, its best performance obtained via the last phase in terms of classification accuracy rate. To evaluate the performance of the proposed method, it has been compared with three-layer algorithm. The results revealed that the proposed method had significantly better results than the three-layer algorithm in average.
کلیدواژه‌ها
Prototype Selection؛ Market Prediction؛ Text Classification؛ Text mining
عنوان مقاله [English]
بهبود روش‌های متن‌کاوی در کاربرد پیش‌بینی بازار با استفاده از الگوریتم‌های انتخاب نمونۀ اولیه
نویسندگان [English]
فرزاد نیکنام¹؛ علی اکبر نیک نفس²
¹دانشجوی کارشناسی ارشد مهندسی کامپیوتر، دانشکدۀ فنی و مهندسی، دانشگاه شهید باهنر کرمان، کرمان، ایران
²استادیار بخش مهندسی کامپیوتر، دانشکدۀ فنی و مهندسی دانشگاه شهید باهنر کرمان، کرمان، ایران
چکیده [English]
امروزه محققان با حجم وسیعی از داده‌ مواجه‎اند که بخش زیادی از آنها ساختار پردازش‎پذیری ندارند. دو مورد از چالش‌های اصلی در این زمینه بالا بودن ابعاد فضای ویژگی و حجیم بودن داده‌های در دسترس است. به‌منظور رفع این چالش‌ها، مقالۀ پیش رو یک روش انتخاب ویژگی مبتنی بر ویژگی‌های هدف ارائه کرده است که در کاهش ابعاد فضای ویژگی تأثیر زیادی دارد و همچنین برای مقابله با حجم بسیار زیاد نمونه‌های آموزش، با استفاده از روش‌های انتخاب نمونۀ اولیه، به ویرایش مجموعۀ آموزش می‎پردازد. روش پیشنهادی در این مقاله در سه فاز اجرا شده است که هر فاز بهبودیافتۀ فاز قبل است و علاوه‎بر دست‎یافتن به نتایج مناسب در هر فاز، در پایان فاز سوم روش پیشنهادی بیشترین کارایی را به‎دست آورد. برای ارزیابی کارایی روش پیشنهادی، این روش با یکی از الگوریتمهای موفق در زمینۀ پیش‌بینی بازار مقایسه شد که با وجود کاهش نمونه‌های آموزش توسط الگوریتم‌های انتخاب نمونۀ اولیه، به نتایج بسیار بهتری نسبت به آن الگوریتم دست یافت.
کلیدواژه‌ها [English]
انتخاب نمونۀ اولیه, پیش‌بینی بازار, طبقه‌بندی متن, متن‌کاوی

مراجع
Aggarwal, C. C. & Zhai, C. (2012). Mining text data. Springer Science & Business Media. ISBN: 978-1-4614-3222-7 (Print) 978-1-4614-3223-4. (Online) Aghabozorgi, S. (2016). Big Data Mining. Retrieved January 09, 2016, from https://sites.google.com/site/bigdatasetmining/Projects/textmining. de Fortuny, E. J., De Smedt, T., Martens, D. & Daelemans, W. (2014). Evaluating and understanding text-based stock price prediction models. Information Processing & Management, 50(2): 426-441. Garcia, S., Derrac, J., Cano, J. R., & Herrera, F. (2012). Prototype selection for nearest neighbor classification: Taxonomy and empirical study. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 34(3), 417-435. García, S., Luengo, J. & Herrera, F. (2015). Data preprocessing in data mining. Switzerland: Springer. Hagenau, M., Liebmann, M., & Neumann, D. (2013). Automated news reading: Stock price prediction based on financial news using context-capturing features. Decision Support Systems, 55(3): 685-697. Huang, C.-J., Liao, J.-J., Yang, D.-X., Chang, T.-Y., & Luo, Y.-C. (2010). Realization of a news dissemination agent based on weighted association rules and text mining techniques. Expert Systems with Applications, 37(9): 6409-6413. Im, T. L., San, P. W., On, C. K., Alfred, R., & Anthony, P. (2014). Impact of Financial News Headline and Content to Market Sentiment. International Journal of Machine Learning and Computing, 4(3): 237-242. Jivani, A. G. (2011). A comparative study of stemming algorithms. Int. J. Computer Technology and Applications, 2(6): 1930-1938. Kim, Y., Jeong, S. R., & Ghani, I. (2014). Text opinion mining to analyze news for stock market prediction. Int. J. Advances in Soft Computing and its Applications, 6(1): 1-13. Manning, C. D., Raghavan, P. & Schütze, H. (2008). Introduction to information retrieval (Vol. 1): Cambridge university press Cambridge. Murty, M. N. & Devi, V. S. (2011). Pattern recognition: An algorithmic approach. India : Springer Science & Business Media. Nassirtoussi, A. K., Aghabozorgi, S., Wah, T. Y. & Ngo, D. C. L. (2014). Text mining for market prediction: A systematic review. Expert Systems with Applications, 41(16): 7653-7670. Nassirtoussi, A. K., Aghabozorgi, S., Wah, T. Y. & Ngo, D. C. L. (2015). Text mining of news-headlines for FOREX market prediction: A Multi-layer Dimension Reduction Algorithm with semantics and sentiment. Expert Systems with Applications, 42(1): 306-324. Nikfarjam, A., Emadzadeh, E. & Muthaiyah, S. (2010). Text mining approaches for stock market prediction. Paper presented at the Computer and Automation Engineering (ICCAE). 2010 The 2nd International Conference on. 26-28 Feb. 2010: Singapore Niknam, F. & Niknafs, A. (2015). Using Training Set Selection Methods to Improve Text Mining on Market Prediction via News Headlines. Paper presented at the The International Congress on Technology, Communication and Knowledge, Mashhad, Iran. (in Persian) Passini, C., Luiza, M., Estébanez, K., Figueredo, G., Ebecken, F. & Nelson, F. (2013). A strategy for training set selection in text classification problems. International Journal of Advanced Computer Science & Applications, 4(6): 54-60. Weiss, S. M., Indurkhya, N. & Zhang, T. (2010). Fundamentals of predictive text mining: Springer Science & Business Media. Yang, J., Liu, Y., Zhu, X., Liu, Z., & Zhang, X. (2012). A new feature selection based on comprehensive measurement both in inter-category and intra-category for text categorization. Information Processing & Management, 48(4): 741-754.
آمار تعداد مشاهده مقاله: 2,293 تعداد دریافت فایل اصل مقاله: 1,324

سامانه مدیریت نشریات علمی. قدرت گرفته از سیناوب

پیوندهای مفید

پیوندهای مفید

اخبار و اعلانات

آمار

Improving Text Mining Methods in Market Prediction via Prototype Selection Algorithms