TY - STD TI - Hurtado J, Huang S, Zhu X. Topic discovery and future trend prediction using association analysis and ensemble forecasting. In: the 16th IEEE international conference on information reuse and integration. San Francisco, CA: 2015. ID - ref1 ER - TY - STD TI - Mei Q, Zhai C. Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In: Proceedings of the eleventh ACM SIGKDD international conference on knowledge discovery in data mining. 2005. ID - ref2 ER - TY - STD TI - Berlanga-Llavori R, Anaya-Sánchez H, Pons-Porrata A, Jiménez-Ruiz E. Conceptual subtopic identification in the medical domain. In: Geffner H, Prada R, Machado Alexandre I, David N, editors. Advances in artificial intelligence—IBERAMIA 2008. Lecture notes in computer science, vol 5290. Springer; 2008. p. 312–21. ID - ref3 ER - TY - STD TI - Mörchen F, Dejori M, Fradkin D, Etienne J, Wachmann B, Bundschus M. Anticipating annotations and emerging trends in biomedical literature. In: Proc. of ACM SIG KDD conference. 2008. ID - ref4 ER - TY - STD TI - Tucker C, Kim H. Predicting emerging product design trend by mining publicly available customer review data. In: Proc. of international conference on engineering design. 2011. ID - ref5 ER - TY - JOUR AU - Schumaker, R. AU - Chen, H. PY - 2012 DA - 2012// TI - Textual analysis of stock market prediction using breaking financial news: The azfin text system JO - ACM Trans Inf Syst. VL - 27 UR - https://doi.org/10.1145/1462198.1462204 DO - 10.1145/1462198.1462204 ID - Schumaker2012 ER - TY - JOUR AU - Newman, D. J. AU - Block, S. PY - 2006 DA - 2006// TI - Probabilistic topic decomposition of an eighteenth-century american newspaper JO - J Am Soc Inf Sci Technol VL - 57 UR - https://doi.org/10.1002/asi.20342 DO - 10.1002/asi.20342 ID - Newman2006 ER - TY - JOUR AU - Blei, D. M. PY - 2012 DA - 2012// TI - Introduction to probabilistic topic models JO - Commun ACM VL - 55 UR - https://doi.org/10.1145/2133806.2133826 DO - 10.1145/2133806.2133826 ID - Blei2012 ER - TY - STD TI - Fu T-c. A review on time series data mining. Eng Appl Artif Intell. 2011;24(1):164–81. ID - ref9 ER - TY - STD TI - Wang X, McCallum A. Topics over time: a non-markov continuous-time model of topical trends. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining. 2006. ID - ref10 ER - TY - JOUR AU - Palla, G. AU - Derényi, I. AU - Farkas, I. PY - 2005 DA - 2005// TI - T T.V. Uncovering the overlapping community structure of complex networks in nature and society JO - Nature. VL - 435 UR - https://doi.org/10.1038/nature03607 DO - 10.1038/nature03607 ID - Palla2005 ER - TY - JOUR AU - Mettrop, W. AU - Nieuwenhuysen, P. PY - 2001 DA - 2001// TI - Internet search engines—fluctuations in document accessibility JO - J Doc VL - 57 UR - https://doi.org/10.1108/EUM0000000007096 DO - 10.1108/EUM0000000007096 ID - Mettrop2001 ER - TY - STD TI - Liu Y, Scheuermann P, Li X, Zhu X. Using wordnet to disambiguate word senses for text classification. In: international conference on computational science. 2007. ID - ref13 ER - TY - STD TI - Sussna M. Word sense disambiguation for free-text indexing using a massive semantic network. In: Proceedings of the second international conference on information and knowledge management (CIKM). 1993. ID - ref14 ER - TY - STD TI - Wiemer-Hastings P, Wiemer-Hastings K, Graesser A. Latent semantic analysis. Proceedings of the 16th international joint conference on artificial intelligence. 2004. p. 1–14. ID - ref15 ER - TY - STD TI - Joshi AC, Padghan VR, Vyawahare JR, Saner SP. Enforcing document clustering for forensic analysis using weighted matrix method (wmm). 2015. ID - ref16 ER - TY - STD TI - Jain AK. Data clustering: 50 years beyond k-means. Pattern Recognit Lett. 2010;31(8):651–66. Award winning papers from the 19th international conference on pattern recognition (ICPR). ID - ref17 ER - TY - STD TI - Stein B, Eissen SMZ. Topic identification: framework and application. In: Proc of international conference on knowledge management (I-KNOW). 2004. ID - ref18 ER - TY - STD TI - Sahami M. Using machine learning to improve information access. Technical report, Stanford University; 1998. ID - ref19 ER - TY - STD TI - Jayabharathy J, Kanmani S, Parveen AA. Document clustering and topic discovery based on semantic similarity in scientific literature. In: Communication software and networks (ICCSN), 2011 IEEE 3rd international conference on. 2011. p. 425–9. ID - ref20 ER - TY - STD TI - Ayad H, Kamel MS. Topic discovery from text using aggregation of different clustering methods. Proceedings of the 15th conference of the Canadian society for computational studies of intelligence on advances in artificial intelligence., AI 02London, UK, UK: Springer; 2002. p. 161–75. ID - ref21 ER - TY - STD TI - Hromic H, Prangnawarat N, Hulpuş I, Karnstedt M, Hayes C. Graph-based methods for clustering topics of interest in twitter. In: Engineering the web in the big data era. Lecture notes in computer science, vol 9114. Springer; 2015. p. 701–4. ID - ref22 ER - TY - STD TI - Wartena C, Brussee R. Topic detection by clustering keywords. In: Database and expert systems application, 2008. DEXA 08. 19th international workshop on. 2008. p. 54–8. ID - ref23 ER - TY - STD TI - Wong PC, Whitney P, Thomas J. Visualizing association rules for text mining. 1999. ID - ref24 ER - TY - JOUR AU - Deerwester, S. AU - Dumais, S. T. AU - Furnas, G. W. AU - Landauer, T. K. AU - Harshman, R. PY - 1990 DA - 1990// TI - Indexing by latent semantic analysis JO - J Am Soc Inf Sci VL - 41 UR - https://doi.org/3.0.CO;2-9 DO - 3.0.CO;2-9 ID - Deerwester1990 ER - TY - STD TI - Dumais ST, Furnas GW, Landauer TK, Deerwester S, Harshman R. Using latent semantic analysis to improve access to textual information. Proceedings of the SIGCHI conference on human factors in computing systems., CHI 88 New York, NY, USA: ACM; 1988. p. 281–5. ID - ref26 ER - TY - JOUR AU - Landauer, T. K. AU - Foltz, P. W. AU - Laham, D. PY - 1998 DA - 1998// TI - An introduction to latent semantic analysis JO - Discourse Process VL - 25 UR - https://doi.org/10.1080/01638539809545028 DO - 10.1080/01638539809545028 ID - Landauer1998 ER - TY - JOUR AU - Steyvers, M. AU - Griffiths, T. PY - 2007 DA - 2007// TI - Probabilistic topic models JO - Handb Latent Sem Anal VL - 427 ID - Steyvers2007 ER - TY - STD TI - Hofmann T. Probabilistic latent semantic indexing. Proceedings of the 22nd annual international ACM SIGIR conference on research and development in information retrieval, SIGIR 99New York, NY, USA: ACM; 1999. p. 50–7. ID - ref29 ER - TY - STD TI - Yano T, Cohen WW, Smith NA. Predicting response to political blog posts with topic models. Proceedings of human language technologies: the 2009 annual conference of the North American chapter of the association for computational linguistics, NAACL 09Stroudsburg, PA, USA: Association for Computational Linguistics; 2009. p. 477–85. ID - ref30 ER - TY - JOUR AU - Blei, D. M. AU - Ng, A. Y. AU - Jordan, M. I. PY - 2003 DA - 2003// TI - Latent dirichlet allocation JO - J Mach Learn Res VL - 3 ID - Blei2003 ER - TY - STD TI - Zhu D, Fukazawa Y, Karapetsas E, Ota J. Intuitive topic discovery by incorporating word-pair connection into lda. Proceedings of the the 2012 IEEE/WIC/ACM international joint conferences on web intelligence and intelligent agent technology-, vol 01. WI-IAT 12Washington, DC, USA: IEEE Computer Society; 2012. p. 303–10. ID - ref32 ER - TY - STD TI - Hurtado J, Taweewitchakreeya N, Zhu X. Who wrote this paper? learning for authorship de-identification using stylometric featuress. In: Information reuse and integration (IRI), 2014 IEEE 15th international conference on. 2014. p. 859–62. ID - ref33 ER - TY - STD TI - Loper E, Bird S. Nltk: the natural language toolkit. Proceedings of the ACL-02 workshop on effective tools and methodologies for teaching natural language processing and computational linguistics-, vol 1. ETMTNLP 02Stroudsburg, PA, USA: Association for Computational Linguistics; 2002. p. 63–70. ID - ref34 ER - TY - STD TI - Toutanova K, Manning CD. Enriching the knowledge sources used in a maximum entropy part-of-speech tagger. In: Proceedings of the joint SIGDAT conference on empirical methods in natural language processing and very large corpora. 2008. ID - ref35 ER - TY - BOOK AU - Witten, I. H. AU - Frank, E. AU - Hall, M. A. PY - 2011 DA - 2011// TI - Data mining: practical machine learning tools and techniques PB - Morgan Kaufmann Publishers Inc. CY - San Francisco, CA, USA ID - Witten2011 ER - TY - STD TI - Ravaee H, Masoudi-Nejad A, Omidi S, Moeini A. Improved immune genetic algorithm for clustering protein-protein interaction network. In: BioInformatics and bioEngineering (BIBE), 2010 IEEE international conference on. 2010. p. 174–9. ID - ref37 ER - TY - STD TI - Community P. Using the weka forecasting plugin. In: Pentaho BI suite community edition. 2011. http://wiki.pentaho.com/display/DATAMINING/Using+the+Weka+Forecasting+Plugin UR - http://wiki.pentaho.com/display/DATAMINING/Using+the+Weka+Forecasting+Plugin ID - ref38 ER - TY - STD TI - Amar Krishnay JZ, Krishnan S. Polarity trend analysis of public sentiment on youtube. In: The 19th international conference on management of data (COMAD). 2013. ID - ref39 ER - TY - STD TI - Tang J, Zhang J, Yao L, Li J, Zhang L, Su Z. Arnetminer: extraction and mining of academic social networks. Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining, KDD 08 New York, NY, USA: ACM; 2008. p. 990–8. ID - ref40 ER -