[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3297662.3365822acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmedesConference Proceedingsconference-collections
research-article

Applications of Generalized Difference Method for Hypothesis Generation to Social Big Data in Concept and Real Spaces

Published: 10 January 2020 Publication History

Abstract

Analytic methodology as to generation of integrated hypotheses is necessary for applications involving different sources of social big data. In this paper, first, we introduce an abstract data model for integrating data management and data mining by using mathematical concepts of families, collections of sets to facilitate reproducibility and accountability required for social big data applications. Next, we describe generalized difference methods as a methodology for generating integrated hypotheses. Finally, we validate our proposal by applying them to three use cases involving data in concept and real spaces by using our data model as their description guided by generalized difference methods.

References

[1]
A. Abdulle and G. Wanner (2002), 200 years of least squares method, Math. Vol. 57, No. 2. pp. 45--60, Springer. https://doi.org/10.1007/PL00000559
[2]
P.J.Brockwell and R.A.Davis (2016), Introduction to Time Series and Forecasting, Springer.
[3]
Cookpad https://cookpad.com/ Accessed 2019
[4]
Masaki Endo, Masaharu Hirota, Shigeyoshi Ohno, and Hiroshi Ishikawa (2018), Best-time Estimation Method using Information Interpolation for Sightseeing Spots, Proc. International Journal of Informatics Society (IJIS) Vol. 10, No 2, pp. 97--105.
[5]
Erich Gamma, Richard Helm, Ralph Johnson, and John Vlissides (1994), Design Patterns: Elements of Reusable Object-Oriented Software, Pearson Education.
[6]
T.H.Hamaza et al. (2011), Genome-wide gene-environment study identifies glutamate receptor gene GRIN2A as a Parkinson's disease modifier gene via interaction with coffee, PLoS Genet. Aug;7(8):e1002237.
[7]
Atsushi Hara, Yukio Yamamoto, Tetsuya Araki, Masaharu Shibata, and Hiroshi Ishikawa (2019), Identification of moon central peak craters by machine learning using Kaguya DEM, Journal of Space Science Informatics Japan: Vol. 8, pp.1--10 (in Japanese)
[8]
Trevor Hastie, Robert Tibshirani, and Jerome Friedman (2009), The Elements of Statistical Learning -Data Mining, Inference, and Prediction, Second Edition, Springer.
[9]
Paul W. Holland, Statistics and Causal Inference (1986), Journal of the American Statistical Association, Vol. 81, No. 396, pp. 945--960.
[10]
IDC, The Diverse and Exploding Digital Universe (white paper, 2008). http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf Accessed November 2017.
[11]
IDC, The Digital Universe In 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East (2012). http://www.emc.com/leadership/digital-universe/iview/index.htm Accessed November 2017
[12]
H. Ishikawa (2015), Social Big Data Mining, CRC Press.
[13]
H. Ishikawa and R. Chbeir (2017), A data model for integrating data management and data mining in social big data, Proceedings of 9th IARIA International Conference on Advances in Multimedia (MMEDIA 2017).
[14]
Hiroshi Ishikawa, Daiju Kato, Endo Masaki, and Masaharu Hirota (2018), Generalized Difference Methods for Generating Integrated Hypotheses in Social Big Data (Invited paper), Proc. The 10th International Conference on Management of Digital EcoSystems (MEDES '18).
[15]
JAXA, KAGUY(SELENE) Data Archive http://darts.isas.jaxa.jp/planet/pdap/selene/index.html. Accessed 2019.
[16]
Felipe Jimenez (2017), Intelligent Vehicles, Enabling Technologies and Future Developments, Butterworth-Heinemann.
[17]
Anna Jurek et al (2014), A survey of commonly used ensemble-based classification techniques, The Knowledge Engineering Review, vol 25, no.5, pp. 551--581.
[18]
M. Kanno, Y. Ehara, M. Hirota, S. Yokoyama, and H. Ishikawa (2016). Visualizing High-Risk paths using Geo-tagged Social Data for Disaster Mitigation, Proc. 9th ACM SIGSPATIAL International Workshop on Location-Based Social Networks (LBSN '16).
[19]
Kodai Kato, Ryuhei Yamada, Yukio Yamamoto, Masaharu Shibata, Shohei Yokoyama, and Hiroshi Ishikawa (2018), Investigation of Orbit Parameters to Classify the Deep Moonquake Source, Journal of Space Science Informatics Japan, Vol. 7, pp. 43--52 (in Japanese).
[20]
MEDES Welcome to the Website of the International Conference on ManagEment of Digital EcoSystems (MEDES) http://medes.sigappfr.org/ Accessed 2018
[21]
Ronald E. Mickens (2018), Difference Equations, CRC Press.
[22]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean (2019), Efficient Estimation of Word Representation in Vector Space arXiv: 1301.3781 Accessed 2019
[23]
K. Mitomi, M. Endo, M. Hirota, S. Yokoyama, Y. Shoji, and H. Ishikawa (2016), How to Find Accessible Free Wi-Fi at Tourist Spots in Japan, Volume 10046 of Lecture Notes in Computer Science, pp. 389--403.
[24]
NIKKEI Asian Review, Japan takes step toward enormous bank of personal data, 2017. https://asia.nikkei.com/Economy/Japan-takes-step-toward-enormous-bank-of-personal-data
[25]
Kensuke Nobumoto, Daiju Kato, Masaki Endo, Masaharu Endo, Masaharu Hirota, and Hiroshi Ishikawa (2017), Multilingualization of Restaurant Menu by Analogical Description, Proc. 9th Workshop on Multimedia for Cooking and Eating Activities (CEA 2017).
[26]
Kensuke Nobumoto, Masaharu Hirota, Daiju Kato, and Hiroshi Ishikawa (2019), Multilingualization of Cooking by Analogical Description, Journal of Japan Society for Fuzzy Theory and Intelligent Informatics, vol.31, no.1, pp526--553 (in Japanese).
[27]
OECD, Evidence in Education: Linking Research and Policy http://www.oecd.org/education/ceri/38797034.pdf Accessed August 2018.
[28]
Ekaterina Olshannikova, Thomas Olsson, Jukka Huhtamäki, and Hannu Kärkkäinen (2017), Conceptualizing big social data. Journal of Big Data, Vol.4, No.1, pp.1--19. https://doi.org/10.1186/s40537-017-0063-x Accessed 2018
[29]
R. D. Peng (2011), Reproducible Research in Computational Science, SCIENCE, VOL 334, 2.
[30]
Prime minister of Japan and his cabinet, Meeting of the Council for the Development of a Tourism Vision to Support the Future of Japan https://japan.kantei.go.jp/97_abe/actions/201603/30article1.html
[31]
G. Salton et al. (1975), A vector space model for automatic indexing, CACM, vol. 18, no.11.
[32]
Nishu Singla, Motion Detection Based on Frame Difference Method (2014), International Journal of Information & Computation Technology, Volume 4, Number 15, pp. 1559--1565.
[33]
D. Smith, R. St. Andre, and M. Eggen (2014), A Transition to Advanced Mathematics, Brooks/Cole Pub Co.
[34]
G. Strang (2019), Linear Algebra and Learning from Data, Wellesley-Cambridge Press.
[35]
E.M. Sweeney et al. (2013), Automatic Lesion Incidence Estimation and Detection in Multiple Sclerosis Using Multisequence Longitudinal MRI, AJNR Am J Neuroradiol. Jan; 34(1): 68--73.
[36]
Tourist culture, tide of destination management No. 234 (July 2017). https://www.jtb.or.jp/w-pcontent/uploads/2017/07/bunka234.pdf Accessed 2019
[37]
Miho Toyoshima, et al. (2017), Discovery of disappointed spots by using tweets posted by foreign visitors to Japan, Proc. the 14th National Meeting of the Society for Tourism Informatics (in Japanese).
[38]
Takahito Tsuchida, Masaki Endo, Daiju Kato, Yo Ehara, Masaharu Hirota, Shohei Yokoyama, and Hiroshi Ishikawa (2016), On calculation of the meanings of the areas and landmarks using Word2Vec, Proc. 8th Forum on Data Engineering and Information Management (DEIM), (in Japanese).
[39]
Sandro Vega-Pons and Jose Ruiz-Shulcloper (2011), A Survey of Clustering Ensemble Algorithms, International Journal of Pattern Recognition and Artificial Intelligence, Vol. 25, No. 3337--372.
[40]
G. Wu, D. Shen, and M.R. Sabuncu (2016), Machine learning and medical imaging, Academic Press.
[41]
Satoru Yamamoto et al. (2018), An Automated Method for Crater Counting from Digital Terrain Model Using Rotational Pixel Swapping Method, Proc. 49th Lunar and Planetary Science Conference.
[42]
Yukio Yamamoto and Hiroshi Ishikawa (2019), Anomaly Detection with Hotelling T-square Method for Raw Housekeeping Telemetry, Proc. International Symposium on Space Technology and Science.

Index Terms

  1. Applications of Generalized Difference Method for Hypothesis Generation to Social Big Data in Concept and Real Spaces

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    MEDES '19: Proceedings of the 11th International Conference on Management of Digital EcoSystems
    November 2019
    350 pages
    ISBN:9781450362382
    DOI:10.1145/3297662
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 January 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Social big data
    2. data management
    3. data mining
    4. data model
    5. difference method
    6. hypothesis generation
    7. integrated analysis

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    MEDES '19

    Acceptance Rates

    MEDES '19 Paper Acceptance Rate 41 of 102 submissions, 40%;
    Overall Acceptance Rate 267 of 682 submissions, 39%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 55
      Total Downloads
    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 23 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media