Home   >   CSC-OpenAccess Library   >    Manuscript Information
A Novel Data Mining Algorithm for Semantic Web Based Data Cloud
N.C.Mahanti, kanhaiya lal
Pages - 160 - 175     |    Revised - 30-04-2010     |    Published - 10-06-2010
Volume - 4   Issue - 2    |    Publication Date - May 2010  Table of Contents
data mining, Cloud, Web
By a cloud, we mean an infrastructure that provides resources and/or services over the Internet. A storage cloud provides storage services, while a compute cloud provides compute services. We describe the design of the Sector storage cloud and how it provides the storage services required by the Sphere compute cloud. Different efforts have been made to address the problem of data mining in the cloud framework. In this paper we propose an algorithm to mine the data from the cloud using sector/sphere framework and association rules. We also describe the programming paradigm supported by the Sphere compute cloud. Sector and Sphere are designed for analyzing large data sets using computer clusters connected with wide area high performance networks
CITED BY (6)  
2 Zheng, Y., Huang, Z., & He, T. (2013). Classification based on both attribute value weight and tuple weight under the cloud computing. Mathematical Problems in Engineering, 2013.
3 Li Nan , & Zhang Xuefu . ( 2013 ) found that knowledge-based Application System related data . Library and Information Service , ( 6 ) , 127-133.
4 Rong, C., Quan, Z., & Chakravorty, A. (2013, December). On Access Control Schemes for Hadoop Data Storage. In Cloud Computing and Big Data (CloudCom-Asia), 2013 International Conference on (pp. 641-645). IEEE.
5 Li Nan , Zhang Xuefu & study found that knowledge-based application system related data.
6 Kode, S. R. (2012). A relational interval tree for efficient insertion and searching of transient mobile data.
1 Google Scholar 
2 ScientificCommons 
3 Academic Index 
4 CiteSeerX 
5 refSeek 
6 iSEEK 
7 Socol@r  
8 ResearchGATE 
9 Libsearch 
10 Bielefeld Academic Search Engine (BASE) 
11 Scribd 
12 WorldCat 
13 SlideShare 
15 PdfSR 
A. Rosenthal et al. “Cloud computing: A new business paradigm for Biomedical information sharing “Journal of Biomedical Informatics, Elsevier ,43 (2010) 342–353 343.
Agrawal R. Imielinski T, Swami. “A Database mining: a performance perspective”. IEEE Transactions on Knowledge and Data Engineering, Dec.1993,5(6): 914 - 925.
Amazon Web Services LLC. “Amazon web services developer connection”. retrieved from developer.amazonwebservices.com on November 1, 2007.
Amazon. Amazon Simple Storage Service (Amazon S3). www.amazon.com/s3.
Dhruba Borthaku. “The hadoop distributed file system: Architecture and design”. retrieved fromlucene.apache.org/hadoop, 2007.
G. Stumme et al. “Web Semantics: Science, Services and Agents on the World Wide Web”, Journal of WEB Semantics, Elsevier, 4 (2006) 124–143.
Han J , Kamber M. “Data Mining: Concepts and Techniques”. 2/e San Francisco: CA. Morgan Kaufmann Publishers, an imprint of Elsevier. pp-259-261, 628-640 (2006)
Hillol Kargupta. Proceedings of Next Generation Data Mining2007. Taylor and Francis, 2008.
I. Stoica, R. Morris, D. Karger, M. F. Kaashoek, and H Balakrishnana. “Chord: A scalable peer to peer lookup service for internet applications”. In Proceedings of the ACM SIGCOMM ’01, pages 149–160, 2001.
Ian Foster and Carl Kesselman. “The Grid 2: Blueprint for a New Computing infrastructure”. Morgan Kaufmann, San Francisco, California, 2004.
Jeffrey Dean and Sanjay Ghemawat. “MapReduce: Simplifieddata processing on large clusters”. In OSDI’04: SixthSymposium on Operating System Design andImplementation, 2004.
Jim Gray and Alexander S. Szalay. “The world-wide telescope”. Science, vol 293:2037–2040, 2001.
John P. Hayes ,”Computer Architecture and Organizaton”,3/e, McGraw-HILL INTERNATIONAL EDITIONS, Computer Science Series, pp- 275-292 (1998)
Robert L. Grossman and Yunhong Gu “Data Mining using high performance data clouds: Experimental Studies using Sector and Sphere”. Retrieved from http://sector.sourceforge.net/pub/grossman-gu-ncdm-tr-08-04.pdf.
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. “The Google File System”. In SOSP, 2003.
Yunhong Gu and Robert L. Grossman. “UDT: UDP-based data transfer for high-speed wide area networks”. Computer Networks, 51(7):1777—1799, 2007.
Yunhong Gu and Robert L. Grossman. “UDT: UDP-based data transfer for high-speed wide area networks”. Computer Networks, 51(7):1777—1799, 2007.
ZouLi & LiangXu, “Mining Association Rules in Distributed System”, First International Workshop on Education Technology and Computer Science,. IEEE, 2009.
Professor N.C.Mahanti
Birla Institute of Technology - India
Mr. kanhaiya lal
Birla Institute of Technology - India