Abstract
In this paper, we propose an annotation scheme for web-3D scenes based on the MPEG-7 standard. We focus on the annotation of 3D scenes that are encoded with the X3D modeling language which is the descendant of VRML. X3D has been adopted by the web service industry as the appropriate framework for developing internet friendly and flexible 3D visualization applications. We introduce MPEG-7 extensions that are necessary in order to fulfill the requirements of the X3D scene structure and we adapt the MPEG-7 schema encoding accordingly. In the annotation scheme, we consider animation and interactivity issues along with geometrical and appearance characteristics of the 3D content providing a more efficient description of the scene. Thus, the extensions proposed in this paper cover all the information required for a complete and efficient description on the position and relative size of 3D objects, specific characteristics such as object type, curvature properties and available textures, combined with the objects’ innate animation properties and its interactions with other objects in the scene or with the end user. The extensions are MPEG-7 Visual and Metadata Descriptors, which fully conform to the standardization restrictions, and we also provide the modifications to the corresponding schema of the ISO 15938 standard that are essential for validating against the proposed MPEG-7 implementation.
Similar content being viewed by others
References
Attene M, Robbiano F, Spagnuolo M, Falcidieno B (2007) “Semantic annotation of 3D surface meshes based on feature characterization”, semantic multimedia, lecture notes in computer science, vol. 4816. Springer, Berlin, pp 126–139. doi:10.1007/978-3-540-77051-0_15
Bilasco IM, Genzel J, Villanova M, Martin H (2005) “On indexing of 3D scenes using MPEG-7”. In: Proceedings of the 13th annual ACM international conference on multimedia (MULTIMEDIA ’05), Singapore, pp. 471–474, ACM, New York, doi:10.1145/1101149.1101254
Bilasco IM, Genzel J, Villanova MO, Martin H (2006) “An MPEG-7 framework enhancing the reuse of 3D models”. In: Proceedings of the eleventh international conference on 3D web technology (Web3D ’06), Columbia, Maryland, ACM, New York, pp. 65–74, doi:10.1145/1122591.1122601
Bilasco IM, Genzel J, Villanova MO, Martin H (2007) “Semantic-based rules for 3D scene adaptation”. In: Proceedings of the twelfth international conference on 3D web technology (Web3D ’07), Perugia, Italy, pp. 97–100, ACM, New York. doi:10.1145/1229390.1229406
Chmielewski J (2008) “Interaction descriptor for 3D objects”. In: Proceedings of the international conference on human system interaction (HSI’08), Krakow, Poland, pp.18–23, May 25–27, doi:10.1109/HSI.2008.4581401
Chmielewski J (2008) “Interaction interfaces for unrestricted multimedia interaction descriptions”. In: Proceedings of the 6th international conference on advances in mobile computing and multimedia (MoMM ’08), ACM, New York, pp. 397–400, doi:10.1145/1497185.1497270
COLLADA Homepage. http://www.khronos.org/collada/
Dasiopoulou S, Tzouvaras V, Kompatsiaris I, Strintzis MG (2010) Enquiring MPEG-7 based multimedia ontologies. Multimed Tools Appl 46(2–3):331–370
Doller M, Kosch H (2008) The MPEG-7 multimedia database system (MPEG-7 MMDB). J Syst Softw 81(9):1559–1580
Doulamis N, Ceacero C, Collantes L, Tektonidis D (2006) “DESYME: development system for mobile services”, 15th IST mobile & wireless communications summit, 4–8 June, Mykonos, Greece
FOCUS K3D Homepage, http://www.focusk3d.eu/
Glantz A, Krutz A, Sikora T, Nunes P, Pereira F (2010) Automatic MPEG-4 sprite coding—comparison of integrated object segmentation algorithms. J Multimed Tools Appl 49(3):483–512
Grana C, Cucchiara R (2006) “Performance of the MPEG-7 shape spectrum descriptor for 3D objects retrieval”. In: Second Italian research conference on digital library management systems, IRCDL, Italy
Halabala P (2003) “Semantic metadata creation.” In: Proceedings of CESCG 2003: 7th central European seminar on computer graphics, pp. 15–25
Halkos D, Doulamis N, Doulamis A (2009) A secure framework exploiting content guided and automated algorithms for real time video searching. Multimed Tools Appl 42:343–375
Hejazi MR, Ho Y-S (2007) An efficient approach to texture-based image retrieval. Int J Imaging Syst Technol 17(5):295–302. doi:10.1002/ima.20120, Wiley Subscription Services
Hejazi MR, Ho Y-S (2007) Efficient approach to extraction of texture browsing descriptor in MPEG-7. Electron Lett 43(13):709–711. doi:10.1049/el:20070208
ISO 15938:3 Multimedia content description interface—part 3: visual. May, 2002
ISO 15938:5 multimedia content description interface—part 5: multimedia description schemes. May, 2003
ISO/IEC 14496-2:2001 coding of audio-video objects—part 2: visual. December, 2001
Kapetanakis K, Spala P, Sympa P, Mamakis G, Malamos AG (2010) “A novel approach in converting SVG architectural data to X3D worlds.” In: Proceedings of the international conference on telecommunications and multimedia, TEMU 2010, Chania, Greece, July 14-16
Koller D, Frischer B, Humphreys G (2009) “Research challenges for digital archives of 3D cultural heritage models,” journal on computing and cultural heritage (JOCCH), vol. 2, issue. 3, article 7. ACM, New York. doi:10.1145/1658346.1658347
Lee K-L, Chen L-H (2005) An efficient computation method for the texture browsing descriptor of MPEG-7. Image Vis Comput 23(5):479–489. doi:10.1016/j.imavis.2004.12.002
Loewenstein Y, Raimondo D, Redfern OC, Watson J, Frishman D, Linial M, Orengo C, Thornton J, Tramontano A (2009) Protein function annotation by homology-based inference. Genome Biol 10(2):207. doi:10.1186/gb-2009-10-2-207
Malamos AG, Mamakis G, Sympa P, Tsirakis M, Piperidis G, Karakechagias J, Mavraganis K, Kaliakatsos Y (2006) “VCLASS-3D: a multimedia educational collaboration platform with 3D virtual workspace support”. In: Proceedings of the 5th IASTED international conference on web-based education (WBE’06), Puerto Vallarta, pp. 19–24, ACTA Press, Anaheim, CA, USA, 23–25 January
Malamos AG, Mamakis G et al. (2009) “Extending X3D-based educational platform for mathematics with multicast networking capabilities”, WBE ’09, 644–038, Phuket, Thailand, March 16–12
Mikolajczyk K, Zisserman A, Schmid C (2003) “Shape recognition with edge-based features”. In: Proceedings of the 13th British machine vision conference (BMVC’03), pp. 779–788, Norwich, U.K
Min P, Kazhdan M, Funkhouser T (2004) “A comparison of text and shape matching for retrieval of online 3D models”, research and advanced technology for digital libraries, lecture notes in computer science, vol. 3232. Springer, Berlin/Heidelberg, pp 209–220. doi:10.1007/978-3-540-30230-8_20
MPEG-7 Homepage. http://mpeg.chiariglione.org/standards/mpeg-7/mpeg-7.htm
MPEG Homepage. http://mpeg.chiariglione.org/
Panagiotakis C, Doulamis A, Tziritas G (2009) Equivalent key frames selection based on iso-content principle. IEEE Trans Circuits Syst Video Technol 19(3):447–451. doi:10.1109/TCSVT.2009.2013517, Circuits and Systems for Video Technology
Papaleo L, Floriani L (2009) “Semantic-based segmentation and annotation of 3D models.” In: Proceedings of the 15th international conference on image analysis and processing (ICIAP ’09), Springer-Verlag, Berlin/Heidelberg, pp. 103–112, doi:10.1007/978-3-642-04146-4_13
Pavlopoulos GA, Wegener A-L, Schneider R (2008) A survey of visualization tools for biological network analysis. BioData Min 1:12. doi:10.1186/1756-0381-1-12
Pein RP, Amador M, Lu J, Wolfgang R (2008) “Using CBIR and semantics in 3D-model retrieval”, 8th IEEE international conference on computer and information technology, CIT 2008, pp. 173–178, 8–11, doi:10.1109/CIT.2008.4594669
Pitarello F, de Faveri A “Semantic description of 3D environments: a proposal based on web standards.” In: Proceedings of the eleventh international conference on 3D web technology (Web3D ’06), Columbia, Maryland, pp. 85–95, ACM, New York, 2006, doi:10.1145/1122591.1122603
Ro YM, Kim M, Kang HK, Manjunath BS, Kim J (2001) MPEG-7 homogeneous texture descriptor. ETRI J 23(2):41–51. doi:10.4218/etrij.01.0101.0201
Shen Y, Ong SK, Nee AYC (2008) Product information visualization and augmentation in collaborative design. Comput-Aided Des 40(9):963–974. doi:10.1016/j.cad.2008.07.003
Sikora T (2001) The MPEG-7 visual standard for content description—an overview. IEEE Trans Circuits Syst Video Technol 11(6):696–702. doi:10.1109/76.927422
Sylaiou S, Liarokapis F, Kotsakis K, Patias P (2009) Virtual museums, a survey and some issues for consideration. J Cult Herit 10(4):520–528. doi:10.1016/j.culher.2009.03.003
Tangelder JWH, Veltkamp RC (2008) A survey of content based 3D shape retrieval methods. Multimed Tools Appl 39(3):441–471
Walczak K (2008) “Flex-VR: configurable 3D web applications”. In: Proceedings of the 2008 international conference on human system interactions (HSI’08), Krakow, Poland, pp. 135–140, May 25–27, doi:10.1109/HSI.2008.4581455
WEB3D CONSORTIUM, Extensible 3D (X3D) ISO/IEC 19775:2004. http://www.web3d.org/x3d/specifications/ISOIEC-19775-X3DAbstractSpecification/, 2004
Yang N-C, Chang W-H, Kuo C-M, Li T-H (2008) A fast MPEG-7 dominant color extraction with new similarity measure for image retrieval. J Vis Commun Image Represent 19(2):92–105. doi:10.1016/j.jvcir.2007.05.003
Zaharia T, Preteux F (2001) “3D Shape-based retrieval within the MPEG-7 framework”. In: Proceedings of the SPIE/EI conference on nonlinear image processing, SPIE/EI 2001 Zaharia T, Prêteux F
Zhang L, Ma J, Xu X, Yuan B (2007) “Rotation invariant image classification based on MPEG-7 homogeneous texture descriptor”, eighth ACIS international conference on software engineering, artificial intelligence, networking, and parallel/distributed computing, 2007(SNPD 2007), 3: 798–803, July 30–Aug. 1, doi:10.1109/SNPD.2007.302
Zhou NN, Deng Y-L (2009) “Virtual reality: a state-of-the-art survey”, international journal of automation and computing, institute of automation, Chinese academy of sciences, co-published with Springer-Verlag GmbH, Vol. 6, issue 4, pp. 319–325, doi:10.1007/s11633-009-0319-9
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Spala, P., Malamos, A.G., Doulamis, A. et al. Extending MPEG-7 for efficient annotation of complex web 3D scenes. Multimed Tools Appl 59, 463–504 (2012). https://doi.org/10.1007/s11042-011-0790-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-011-0790-5