Extending MPEG-7 for efficient annotation of complex web 3D scenes

Patti Spala^1,2,
Athanasios G. Malamos²,
Anastasios Doulamis³ &
…
George Mamakis^1,2

263 Accesses
12 Citations
2 Altmetric
5 Mentions
Explore all metrics

Abstract

In this paper, we propose an annotation scheme for web-3D scenes based on the MPEG-7 standard. We focus on the annotation of 3D scenes that are encoded with the X3D modeling language which is the descendant of VRML. X3D has been adopted by the web service industry as the appropriate framework for developing internet friendly and flexible 3D visualization applications. We introduce MPEG-7 extensions that are necessary in order to fulfill the requirements of the X3D scene structure and we adapt the MPEG-7 schema encoding accordingly. In the annotation scheme, we consider animation and interactivity issues along with geometrical and appearance characteristics of the 3D content providing a more efficient description of the scene. Thus, the extensions proposed in this paper cover all the information required for a complete and efficient description on the position and relative size of 3D objects, specific characteristics such as object type, curvature properties and available textures, combined with the objects’ innate animation properties and its interactions with other objects in the scene or with the end user. The extensions are MPEG-7 Visual and Metadata Descriptors, which fully conform to the standardization restrictions, and we also provide the modifications to the corresponding schema of the ISO 15938 standard that are essential for validating against the proposed MPEG-7 implementation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Multi-platform Semantic Representation of Interactive 3D Content

PhotoSketch: a photocentric urban 3D modeling system

Article 07 April 2017

Sprite tree: an efficient image-based representation for networked virtual environments

Article 11 July 2016

References

Attene M, Robbiano F, Spagnuolo M, Falcidieno B (2007) “Semantic annotation of 3D surface meshes based on feature characterization”, semantic multimedia, lecture notes in computer science, vol. 4816. Springer, Berlin, pp 126–139. doi:10.1007/978-3-540-77051-0_15
Google Scholar
Bilasco IM, Genzel J, Villanova M, Martin H (2005) “On indexing of 3D scenes using MPEG-7”. In: Proceedings of the 13th annual ACM international conference on multimedia (MULTIMEDIA ’05), Singapore, pp. 471–474, ACM, New York, doi:10.1145/1101149.1101254
Bilasco IM, Genzel J, Villanova MO, Martin H (2006) “An MPEG-7 framework enhancing the reuse of 3D models”. In: Proceedings of the eleventh international conference on 3D web technology (Web3D ’06), Columbia, Maryland, ACM, New York, pp. 65–74, doi:10.1145/1122591.1122601
Bilasco IM, Genzel J, Villanova MO, Martin H (2007) “Semantic-based rules for 3D scene adaptation”. In: Proceedings of the twelfth international conference on 3D web technology (Web3D ’07), Perugia, Italy, pp. 97–100, ACM, New York. doi:10.1145/1229390.1229406
Chmielewski J (2008) “Interaction descriptor for 3D objects”. In: Proceedings of the international conference on human system interaction (HSI’08), Krakow, Poland, pp.18–23, May 25–27, doi:10.1109/HSI.2008.4581401
Chmielewski J (2008) “Interaction interfaces for unrestricted multimedia interaction descriptions”. In: Proceedings of the 6th international conference on advances in mobile computing and multimedia (MoMM ’08), ACM, New York, pp. 397–400, doi:10.1145/1497185.1497270
COLLADA Homepage. http://www.khronos.org/collada/
Dasiopoulou S, Tzouvaras V, Kompatsiaris I, Strintzis MG (2010) Enquiring MPEG-7 based multimedia ontologies. Multimed Tools Appl 46(2–3):331–370
Article Google Scholar
Doller M, Kosch H (2008) The MPEG-7 multimedia database system (MPEG-7 MMDB). J Syst Softw 81(9):1559–1580
Article Google Scholar
Doulamis N, Ceacero C, Collantes L, Tektonidis D (2006) “DESYME: development system for mobile services”, 15th IST mobile & wireless communications summit, 4–8 June, Mykonos, Greece
FOCUS K3D Homepage, http://www.focusk3d.eu/
Glantz A, Krutz A, Sikora T, Nunes P, Pereira F (2010) Automatic MPEG-4 sprite coding—comparison of integrated object segmentation algorithms. J Multimed Tools Appl 49(3):483–512
Article Google Scholar
Grana C, Cucchiara R (2006) “Performance of the MPEG-7 shape spectrum descriptor for 3D objects retrieval”. In: Second Italian research conference on digital library management systems, IRCDL, Italy
Halabala P (2003) “Semantic metadata creation.” In: Proceedings of CESCG 2003: 7th central European seminar on computer graphics, pp. 15–25
Halkos D, Doulamis N, Doulamis A (2009) A secure framework exploiting content guided and automated algorithms for real time video searching. Multimed Tools Appl 42:343–375
Article Google Scholar
Hejazi MR, Ho Y-S (2007) An efficient approach to texture-based image retrieval. Int J Imaging Syst Technol 17(5):295–302. doi:10.1002/ima.20120, Wiley Subscription Services
Article Google Scholar
Hejazi MR, Ho Y-S (2007) Efficient approach to extraction of texture browsing descriptor in MPEG-7. Electron Lett 43(13):709–711. doi:10.1049/el:20070208
Article Google Scholar
ISO 15938:3 Multimedia content description interface—part 3: visual. May, 2002
ISO 15938:5 multimedia content description interface—part 5: multimedia description schemes. May, 2003
ISO/IEC 14496-2:2001 coding of audio-video objects—part 2: visual. December, 2001
Kapetanakis K, Spala P, Sympa P, Mamakis G, Malamos AG (2010) “A novel approach in converting SVG architectural data to X3D worlds.” In: Proceedings of the international conference on telecommunications and multimedia, TEMU 2010, Chania, Greece, July 14-16
Koller D, Frischer B, Humphreys G (2009) “Research challenges for digital archives of 3D cultural heritage models,” journal on computing and cultural heritage (JOCCH), vol. 2, issue. 3, article 7. ACM, New York. doi:10.1145/1658346.1658347
Google Scholar
Lee K-L, Chen L-H (2005) An efficient computation method for the texture browsing descriptor of MPEG-7. Image Vis Comput 23(5):479–489. doi:10.1016/j.imavis.2004.12.002
Article Google Scholar
Loewenstein Y, Raimondo D, Redfern OC, Watson J, Frishman D, Linial M, Orengo C, Thornton J, Tramontano A (2009) Protein function annotation by homology-based inference. Genome Biol 10(2):207. doi:10.1186/gb-2009-10-2-207
Article Google Scholar
Malamos AG, Mamakis G, Sympa P, Tsirakis M, Piperidis G, Karakechagias J, Mavraganis K, Kaliakatsos Y (2006) “VCLASS-3D: a multimedia educational collaboration platform with 3D virtual workspace support”. In: Proceedings of the 5th IASTED international conference on web-based education (WBE’06), Puerto Vallarta, pp. 19–24, ACTA Press, Anaheim, CA, USA, 23–25 January
Malamos AG, Mamakis G et al. (2009) “Extending X3D-based educational platform for mathematics with multicast networking capabilities”, WBE ’09, 644–038, Phuket, Thailand, March 16–12
Mikolajczyk K, Zisserman A, Schmid C (2003) “Shape recognition with edge-based features”. In: Proceedings of the 13th British machine vision conference (BMVC’03), pp. 779–788, Norwich, U.K
Min P, Kazhdan M, Funkhouser T (2004) “A comparison of text and shape matching for retrieval of online 3D models”, research and advanced technology for digital libraries, lecture notes in computer science, vol. 3232. Springer, Berlin/Heidelberg, pp 209–220. doi:10.1007/978-3-540-30230-8_20
Google Scholar
MPEG-7 Homepage. http://mpeg.chiariglione.org/standards/mpeg-7/mpeg-7.htm
MPEG Homepage. http://mpeg.chiariglione.org/
Panagiotakis C, Doulamis A, Tziritas G (2009) Equivalent key frames selection based on iso-content principle. IEEE Trans Circuits Syst Video Technol 19(3):447–451. doi:10.1109/TCSVT.2009.2013517, Circuits and Systems for Video Technology
Article Google Scholar
Papaleo L, Floriani L (2009) “Semantic-based segmentation and annotation of 3D models.” In: Proceedings of the 15th international conference on image analysis and processing (ICIAP ’09), Springer-Verlag, Berlin/Heidelberg, pp. 103–112, doi:10.1007/978-3-642-04146-4_13
Pavlopoulos GA, Wegener A-L, Schneider R (2008) A survey of visualization tools for biological network analysis. BioData Min 1:12. doi:10.1186/1756-0381-1-12
Article Google Scholar
Pein RP, Amador M, Lu J, Wolfgang R (2008) “Using CBIR and semantics in 3D-model retrieval”, 8th IEEE international conference on computer and information technology, CIT 2008, pp. 173–178, 8–11, doi:10.1109/CIT.2008.4594669
Pitarello F, de Faveri A “Semantic description of 3D environments: a proposal based on web standards.” In: Proceedings of the eleventh international conference on 3D web technology (Web3D ’06), Columbia, Maryland, pp. 85–95, ACM, New York, 2006, doi:10.1145/1122591.1122603
Ro YM, Kim M, Kang HK, Manjunath BS, Kim J (2001) MPEG-7 homogeneous texture descriptor. ETRI J 23(2):41–51. doi:10.4218/etrij.01.0101.0201
Article Google Scholar
Shen Y, Ong SK, Nee AYC (2008) Product information visualization and augmentation in collaborative design. Comput-Aided Des 40(9):963–974. doi:10.1016/j.cad.2008.07.003
Article Google Scholar
Sikora T (2001) The MPEG-7 visual standard for content description—an overview. IEEE Trans Circuits Syst Video Technol 11(6):696–702. doi:10.1109/76.927422
Article MathSciNet Google Scholar
Sylaiou S, Liarokapis F, Kotsakis K, Patias P (2009) Virtual museums, a survey and some issues for consideration. J Cult Herit 10(4):520–528. doi:10.1016/j.culher.2009.03.003
Article Google Scholar
Tangelder JWH, Veltkamp RC (2008) A survey of content based 3D shape retrieval methods. Multimed Tools Appl 39(3):441–471
Article Google Scholar
Walczak K (2008) “Flex-VR: configurable 3D web applications”. In: Proceedings of the 2008 international conference on human system interactions (HSI’08), Krakow, Poland, pp. 135–140, May 25–27, doi:10.1109/HSI.2008.4581455
WEB3D CONSORTIUM, Extensible 3D (X3D) ISO/IEC 19775:2004. http://www.web3d.org/x3d/specifications/ISOIEC-19775-X3DAbstractSpecification/, 2004
Yang N-C, Chang W-H, Kuo C-M, Li T-H (2008) A fast MPEG-7 dominant color extraction with new similarity measure for image retrieval. J Vis Commun Image Represent 19(2):92–105. doi:10.1016/j.jvcir.2007.05.003
Article Google Scholar
Zaharia T, Preteux F (2001) “3D Shape-based retrieval within the MPEG-7 framework”. In: Proceedings of the SPIE/EI conference on nonlinear image processing, SPIE/EI 2001 Zaharia T, Prêteux F
Zhang L, Ma J, Xu X, Yuan B (2007) “Rotation invariant image classification based on MPEG-7 homogeneous texture descriptor”, eighth ACIS international conference on software engineering, artificial intelligence, networking, and parallel/distributed computing, 2007(SNPD 2007), 3: 798–803, July 30–Aug. 1, doi:10.1109/SNPD.2007.302
Zhou NN, Deng Y-L (2009) “Virtual reality: a state-of-the-art survey”, international journal of automation and computing, institute of automation, Chinese academy of sciences, co-published with Springer-Verlag GmbH, Vol. 6, issue 4, pp. 319–325, doi:10.1007/s11633-009-0319-9

Download references

Author information

Authors and Affiliations

Faculty of Advanced Technology, University of Glamorgan, Treforest, Wales, UK
Patti Spala & George Mamakis
Department of Applied Informatics and Multimedia, Technological Educational Institute of Crete, Crete, Greece
Patti Spala, Athanasios G. Malamos & George Mamakis
Decision Support Lab. Technical University of Crete, Chania, Greece
Anastasios Doulamis

Authors

Patti Spala
View author publications
You can also search for this author in PubMed Google Scholar
Athanasios G. Malamos
View author publications
You can also search for this author in PubMed Google Scholar
Anastasios Doulamis
View author publications
You can also search for this author in PubMed Google Scholar
George Mamakis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Athanasios G. Malamos.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Spala, P., Malamos, A.G., Doulamis, A. et al. Extending MPEG-7 for efficient annotation of complex web 3D scenes. Multimed Tools Appl 59, 463–504 (2012). https://doi.org/10.1007/s11042-011-0790-5

Download citation

Published: 08 April 2011
Issue Date: July 2012
DOI: https://doi.org/10.1007/s11042-011-0790-5

Extending MPEG-7 for efficient annotation of complex web 3D scenes

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-platform Semantic Representation of Interactive 3D Content

PhotoSketch: a photocentric urban 3D modeling system

Sprite tree: an efficient image-based representation for networked virtual environments

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Extending MPEG-7 for efficient annotation of complex web 3D scenes

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-platform Semantic Representation of Interactive 3D Content

PhotoSketch: a photocentric urban 3D modeling system

Sprite tree: an efficient image-based representation for networked virtual environments

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation