TW202114409A

TW202114409A - Method and system for setting presentable virtual object for target

Info

Publication number: TW202114409A
Application number: TW109133377A
Authority: TW
Inventors: 李江亮; 牛旭恒; 周磑; 方俊
Original assignee: 大陸商北京外號信息技術有限公司
Priority date: 2019-09-26
Filing date: 2020-09-25
Publication date: 2021-04-01
Also published as: CN112561952B; CN112561952A; WO2021057887A1; TWI750822B

Abstract

A method and system for setting a presentable virtual object for a target, wherein a camera and an optical communication device are installed in the augmented reality environment, and the camera and the optical communication device have a relative pose, the method includes: The camera tracks the target in the augmented reality; obtains the position signal of the target according to the tracking result of the camera; sets a virtual object associated with the target with a spatial position signal, wherein the spatial position signal of the virtual object is based on the location signal of the target is determined; and the related signal of the virtual object is sent to the first device, the signal includes the spatial location signal of the virtual object, wherein the related signal of the virtual object can be used by the first device. The virtual object is presented on the display medium based on the position signal and the posture signal determined by the optical communication device.

Description

Method and system for setting presentable virtual object for target

本發明屬擴增實境技術領域，尤其涉及一種用於為現實場景中的目標設置可呈現的虛擬對象的方法和系統。 The present invention belongs to the field of augmented reality technology, and in particular relates to a method and system for setting a presentable virtual object for a target in a real scene.

本部分的陳述僅僅是為了提供與本發明相關的背景訊號，以幫助理解本發明，這些背景訊號並不一定構成現有技術。 The statements in this section are only to provide background signals related to the present invention to help understand the present invention, and these background signals do not necessarily constitute prior art.

近年來，擴增實境(Augmented Reality，AR)技術取得了長足的進展。擴增實境技術也被稱為混合現實技術，其通過計算機技術將虛擬對象疊加到現實場景中，使得現實場景和虛擬對象能夠實時地呈現到同一個畫面或空間中，從而增強用戶對現實世界的感知。由於擴增實境技術能夠對真實環境進行增強顯示輸出，其在醫療研究與解剖訓練、精密儀器製造和維修、軍用飛機導航、工程設計和遠程機器人控制等技術領域，獲得了越來越廣泛的應用。 In recent years, Augmented Reality (AR) technology has made great progress. Augmented reality technology is also known as mixed reality technology. It superimposes virtual objects into the real scene through computer technology, so that the real scene and virtual objects can be presented in the same screen or space in real time, thereby enhancing the user’s perception of the real world. Perception. Because augmented reality technology can enhance the display output of the real environment, it has gained more and more widespread use in medical research and anatomy training, precision instrument manufacturing and maintenance, military aircraft navigation, engineering design, and remote robot control. application.

在一種擴增實境應用中，可以在視野中的固定位置處疊加一些數據訊號，例如，飛行員在駕駛飛機的時候，可以通過佩戴顯示頭盔來查看在現實場景上疊加的飛行數據，這些數據通常都是顯示在視野中的固定位置處(例如，始終在視野的右上角)，因而缺乏足夠的靈活性，特別是，所疊加的虛擬對象無法隨著現實場景中的真實目標的移動而移動。在另一種擴增實境應用中，可以在用戶手機拍攝到的人物附近疊加一些虛擬對象，但是這種方式準確度不足，並且不適合在多個人之間的AR交互。 In an augmented reality application, some data signals can be superimposed on a fixed position in the field of view. For example, when a pilot is flying an airplane, he can view the flight data superimposed on the real scene by wearing a display helmet. They are all displayed at a fixed position in the field of view (for example, always in the upper right corner of the field of view), and therefore lack sufficient flexibility. In particular, the superimposed virtual object cannot move with the movement of the real target in the real scene. In another augmented reality application, some virtual objects can be superimposed near the person photographed by the user's mobile phone, but this method is not accurate enough and is not suitable for AR interaction between multiple persons.

本發明的方案提供了一種用於為現實場景中的目標設置可呈現的虛擬對象的方法和系統，其通過使用光通訊裝置和攝像頭，使得能夠基於現實場景中的目標的位置訊號為其設置相關聯的虛擬對象，該虛擬對象能夠在設備的顯示媒介上精確地呈現，並且所呈現的虛擬對象能夠跟隨現實場景中的目標。 The solution of the present invention provides a method and system for setting a presentable virtual object for a target in a real scene, which uses an optical communication device and a camera to make It is necessary to set the associated virtual object based on the position signal of the target in the real scene, the virtual object can be accurately presented on the display medium of the device, and the presented virtual object can follow the target in the real scene.

本發明的一個方面涉及一種用於為現實場景中的目標設置可呈現的虛擬對象的方法，其中，所述現實場景中安裝有攝像頭和光通訊裝置，所述攝像頭和所述光通訊裝置之間具有相對位姿，所述方法包括：使用所述攝像頭跟蹤所述現實場景中的目標；根據所述攝像頭的跟蹤結果獲得所述目標的位置訊號；設置與所述目標相關聯的具有空間位置訊號的虛擬對象，其中，所述虛擬對象的空間位置訊號基於所述目標的位置訊號而確定；以及將所述虛擬對象的相關訊號發送給第一設備，該訊號中包括所述虛擬對象的空間位置訊號，其中，所述虛擬對象的相關訊號能夠被所述第一設備使用以基於其通過所述光通訊裝置確定的位置訊號和姿態訊號在其顯示媒介上呈現所述虛擬對象。 One aspect of the present invention relates to a method for setting a presentable virtual object for a target in a real scene, wherein a camera and an optical communication device are installed in the real scene, and there is an optical communication device between the camera and the optical communication device. Relative to the pose, the method includes: using the camera to track a target in the real scene; obtaining a position signal of the target according to the tracking result of the camera; setting a spatial position signal associated with the target A virtual object, wherein the spatial position signal of the virtual object is determined based on the position signal of the target; and a signal related to the virtual object is sent to the first device, and the signal includes the spatial position signal of the virtual object Wherein, the related signal of the virtual object can be used by the first device to present the virtual object on its display medium based on the position signal and the posture signal determined by the optical communication device.

可選地，其中，所述根據所述攝像頭的跟蹤結果獲得所述目標的位置訊號包括：根據所述攝像頭的跟蹤結果獲得所述目標相對於所述攝像頭的位置訊號，以及根據所述目標相對於所述攝像頭的位置訊號以及所述攝像頭和所述光通訊裝置之間的相對位姿來確定所述目標相對於所述光通訊裝置的位置訊號。 Optionally, wherein the obtaining the position signal of the target according to the tracking result of the camera includes: obtaining the position signal of the target relative to the camera according to the tracking result of the camera, and obtaining the position signal of the target relative to the camera according to the tracking result of the camera; The position signal of the camera and the relative pose between the camera and the optical communication device are used to determine the position signal of the target relative to the optical communication device.

可選地，其中，所述虛擬對象的空間位置訊號是相對於所述光通訊裝置的空間位置訊號。 Optionally, wherein the spatial position signal of the virtual object is a spatial position signal relative to the optical communication device.

可選地，其中，所述根據所述攝像頭的跟蹤結果獲得所述目標的位置訊號包括：根據所述攝像頭的跟蹤結果獲得所述目標相對於所述攝像頭的位置訊號，以及根據所述目標相對於所述攝像頭的位置訊號以及所述攝像頭在所述現實場景中的位姿訊號來確定所述目標在所述現實場景中的位置訊號。 Optionally, wherein the obtaining the position signal of the target according to the tracking result of the camera includes: obtaining the position signal of the target relative to the camera according to the tracking result of the camera, and obtaining the position signal of the target relative to the camera according to the tracking result of the camera; The position signal of the camera and the pose signal of the camera in the real scene are used to determine the position signal of the target in the real scene.

可選地，其中，根據與所述目標有關的訊號來配置所述虛擬對象的相關訊號。 Optionally, wherein the related signal of the virtual object is configured according to the signal related to the target.

可選地，所述方法還包括：根據所述攝像頭的跟蹤結果獲得與所述目標有關的訊號。 Optionally, the method further includes: obtaining a signal related to the target according to the tracking result of the camera.

可選地，其中，所述根據與所述目標有關的訊號來配置所述虛擬對象的相關訊號包括：從設施接收與所述目標有關的訊號；將所述訊號與所述攝像頭跟蹤的位於相對於所述設施的預定位置處的目標建立聯繫；以及根據所述訊號配置所述目標的虛擬對象的相關訊號。 Optionally, wherein the configuring the signal related to the virtual object according to the signal related to the target includes: receiving a signal related to the target from a facility; Establishing a connection with a target at a predetermined location of the facility; and configuring the relevant signal of the virtual object of the target according to the signal.

可選地，其中，所述根據與所述目標有關的訊號來配置所述虛擬對象的相關訊號包括：從第二設備接收訊號，該訊號包括所述第二設備的位置訊號；將所述第二設備的位置訊號與根據所述攝像頭的跟蹤結果確定的一個或多個目標的位置訊號進行比較，以確定與所述第二設備匹配的目標；以及根據來自所述第二設備的訊號配置與所述第二設備匹配的目標的虛擬對象的相關訊號。 Optionally, wherein the configuring the related signal of the virtual object according to the signal related to the target includes: receiving a signal from a second device, the signal including a position signal of the second device; The position signal of the second device is compared with the position signal of one or more targets determined according to the tracking result of the camera to determine a target that matches with the second device; and according to the signal configuration from the second device and The related signal of the virtual object of the target matched by the second device.

可選地，其中，所述第二設備至少部分地通過採集包括所述光通訊裝置的圖像並分析該圖像來確定其位置訊號。 Optionally, wherein the second device determines its position signal at least in part by collecting an image including the optical communication device and analyzing the image.

可選地，所述方法還包括：根據所述攝像頭的跟蹤結果獲得所述目標的姿態訊號；以及其中，所述設置與所述目標相關聯的具有空間位置訊號的虛擬對象包括根據所述目標的姿態訊號設置所述虛擬對象的姿態訊號。 Optionally, the method further includes: obtaining a posture signal of the target according to the tracking result of the camera; and wherein the setting a virtual object associated with the target with a spatial position signal includes The gesture signal of sets the gesture signal of the virtual object.

可選地，其中，所述虛擬對象的姿態能夠根據所述第一設備相對於所述虛擬對象的位置和/或姿態變化而調整。 Optionally, wherein the posture of the virtual object can be adjusted according to a change in the position and/or posture of the first device relative to the virtual object.

可選地，其中，所述第一設備至少部分地通過採集包括所述光通訊裝置的圖像並分析該圖像來確定其相對於所述光通訊裝置的位置訊號和姿態訊號。 Optionally, wherein the first device at least partially collects an image including the optical communication device and analyzes the image to determine its position signal and attitude signal relative to the optical communication device.

可選地，其中，根據與所述第一設備有關的訊號確定要發送給所述第一設備的所述虛擬對象的相關訊號。 Optionally, wherein the signal related to the virtual object to be sent to the first device is determined according to the signal related to the first device.

本發明的另一個方面涉及一種用於為現實場景中的目標設置可呈現的虛擬對象的系統，包括：安裝於所述現實場景中的攝像頭，其用於跟蹤所述現實場景中的目標；安裝於所述現實場景中的光通訊裝置，其中，所述光通訊裝置和所述攝像頭具有相對位姿；以及伺服器，其用於執行上述方法。 Another aspect of the present invention relates to a system for setting a presentable virtual object for a target in a real scene, including: a camera installed in the real scene for tracking the target in the real scene; installation The optical communication device in the real scene, wherein the optical communication device and the camera have relative poses; and a server for executing the above method.

可選地，所述系統還包括能夠獲得與所述目標有關的訊號的設施。 Optionally, the system further includes information that can obtain information related to the target. No. of facilities.

可選地，所述系統還包括所述第一設備，其用於：從所述伺服器接收所述虛擬對象的相關訊號；至少部分地通過所述光通訊裝置確定所述第一設備的位置訊號和姿態訊號；以及基於所述位置訊號和姿態訊號以及所述虛擬對象的相關訊號在所述第一設備的顯示媒介上呈現所述虛擬對象。 Optionally, the system further includes the first device, configured to: receive a signal related to the virtual object from the server; determine the location of the first device at least in part through the optical communication device A signal and a posture signal; and based on the position signal, the posture signal and the related signal of the virtual object, the virtual object is presented on the display medium of the first device.

本發明的另一個方面涉及一種存儲介質，其中存儲有計算機程式，在所述計算機程式被處理器執行時，能夠用於實現上述方法。 Another aspect of the present invention relates to a storage medium in which a computer program is stored, and when the computer program is executed by a processor, it can be used to implement the above method.

本發明的另一個方面涉及一種電子設備，包括處理器和存儲器，所述存儲器中存儲有計算機程式，在所述計算機程式被處理器執行時，能夠用於實現上述方法。 Another aspect of the present invention relates to an electronic device, including a processor and a memory, and a computer program is stored in the memory. When the computer program is executed by the processor, the computer program can be used to implement the above method.

100:光標籤 100: light label

101:第一光源 101: The first light source

102:第二光源 102: second light source

103:第三光源 103: third light source

S301~S304:步驟流程 S301~S304: Step flow

圖1示出了一種示例性的光標籤。 Figure 1 shows an exemplary optical label.

圖2示出了根據一個實施例的包含用於為目標設置可呈現的虛擬對象的系統的現實場景。 Fig. 2 shows a real-world scene including a system for setting a presentable virtual object for a target according to one embodiment.

圖3示出了根據一個實施例的用於為目標設置可呈現的虛擬對象的方法。 Fig. 3 shows a method for setting a presentable virtual object for a target according to an embodiment.

圖4示出了根據一個實施例的由圖2中的第一用戶通過手機螢幕或者AR眼鏡觀察到的示意圖像。 Fig. 4 shows a schematic image observed by the first user in Fig. 2 through a mobile phone screen or AR glasses according to an embodiment.

為了使本發明的目的、技術方案及優點更加清楚明白，以下結合附圖通過具體實施例對本發明進一步詳細說明。應當理解，此處所描述的具體實施例僅僅用以解釋本發明，並不用於限定本發明。 In order to make the objectives, technical solutions, and advantages of the present invention clearer, the following further describes the present invention in detail through specific embodiments with reference to the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, but not to limit the present invention.

光通訊裝置也稱為光標籤，這兩個術語在本文中可以互換使用。光標籤能夠通過不同的發光方式來傳遞訊號，其具有識別距離遠、可見光條件要求寬鬆的優勢，並且光標籤所傳遞的訊號可以隨時間變化，從而可以提供大的訊號容量和靈活的配置能力。相比於傳統的二維條碼，光標籤具有更遠的識別距離和更強的訊號交互能力，從而可以為用戶提供巨大的便利性。 Optical communication devices are also called optical tags, and these two terms can be used interchangeably in this article. Optical tags can transmit signals through different light-emitting methods, which have the advantages of long recognition distance and relaxed requirements for visible light conditions, and the signals transmitted by optical tags can change over time, which can provide large signal capacity and flexible configuration capabilities. Compared with the traditional two-dimensional bar code, The optical tag has a longer recognition distance and stronger signal interaction capability, which can provide users with great convenience.

光標籤中通常可以包括控制器和至少一個光源，該控制器可以通過不同的驅動模式來驅動光源，以向外傳遞不同的訊號。圖1示出了一種示例性的光標籤100，其包括三個光源(分別是第一光源101、第二光源102、第三光源103)。光標籤100還包括控制器(在圖1中未示出)，其用於根據要傳遞的訊號為每個光源選擇相應的驅動模式。例如，在不同的驅動模式下，控制器可以使用不同的驅動信號來控制光源的發光方式，從而使得當使用具有成像功能的設備拍攝光標籤100時，其中的光源的成像可以呈現出不同的外觀(例如，不同的顏色、圖案、亮度、等等)。通過分析光標籤100中的光源的成像，可以解析出各個光源此刻的驅動模式，從而解析出光標籤100此刻傳遞的訊號。 The optical tag usually includes a controller and at least one light source, and the controller can drive the light source through different driving modes to transmit different signals to the outside. Fig. 1 shows an exemplary optical label 100, which includes three light sources (respectively a first light source 101, a second light source 102, and a third light source 103). The optical label 100 also includes a controller (not shown in FIG. 1), which is used to select a corresponding driving mode for each light source according to the signal to be transmitted. For example, in different driving modes, the controller can use different driving signals to control the light emitting mode of the light source, so that when a device with imaging function is used to photograph the light label 100, the imaging of the light source therein can show a different appearance. (For example, different colors, patterns, brightness, etc.). By analyzing the imaging of the light source in the optical label 100, the driving mode of each light source at the moment can be analyzed, so as to analyze the signal transmitted by the optical label 100 at the moment.

為了基於光標籤向用戶提供相應的服務，每個光標籤可以被分配一個標識訊號(ID)，該標識訊號用於由光標籤的製造者、管理者或使用者等唯一地識別或標識光標籤。通常，可由光標籤中的控制器驅動光源以向外傳遞該標識訊號，而用戶可以使用設備對光標籤進行圖像採集來獲得該光標籤傳遞的標識訊號，從而可以基於該標識訊號來訪問相應的服務，例如，訪問與標識訊號相關聯的網頁、獲取與標識訊號相關聯的其他訊號(例如，與該標識訊號對應的光標籤的位置訊號)等等。本文提到的設備例如可以是用戶攜帶或控制的設備(例如，手機、平板電腦、智能眼鏡、AR眼鏡、智能頭盔、智能手錶等等)，也可以是能夠自主移動的機器(例如，無人機、無人駕駛汽車、機器人等等)。設備可以通過其上的攝像頭對光標籤進行圖像採集來獲得包含光標籤的圖像，並通過分析圖像中的光標籤(或光標籤中的各個光源)的成像以識別出光標籤傳遞的訊號。 In order to provide users with corresponding services based on optical tags, each optical tag can be assigned an identification signal (ID), which is used to uniquely identify or identify the optical tag by the manufacturer, manager, or user of the optical tag . Generally, the light source can be driven by the controller in the optical tag to transmit the identification signal outward, and the user can use the device to collect the image of the optical tag to obtain the identification signal transmitted by the optical tag, so that the corresponding identification signal can be accessed based on the identification signal. Services, such as accessing web pages associated with the identification signal, obtaining other signals associated with the identification signal (for example, the position signal of the optical tag corresponding to the identification signal), and so on. The devices mentioned in this article can be, for example, devices carried or controlled by users (for example, mobile phones, tablets, smart glasses, AR glasses, smart helmets, smart watches, etc.), or machines that can move autonomously (for example, drones). , Driverless cars, robots, etc.). The device can collect the image of the optical label through the camera on it to obtain the image containing the optical label, and identify the signal transmitted by the optical label by analyzing the imaging of the optical label (or each light source in the optical label) in the image .

可以在伺服器上保存每個光標籤的標識訊號(ID)或其他訊號，例如與該光標籤相關的服務訊號、與該光標籤相關的描述訊號或屬性，如光標籤的位置訊號、型號訊號、物理尺寸訊號、物理形狀訊號、姿態或朝向訊號等。光標籤也可以具有統一的或默認的物理尺寸訊號和物理形狀訊號等。設備可以使用識別出的光標籤的標識訊號來從伺服器查詢獲得與該光標籤有關的其他訊號。光標籤的位置訊號可以是指該光標籤在物理世界中的實際位置，其可以通過地理座標訊號來指示。伺服器可以是在計算裝置上運行的軟件程式、一台計算裝置或者由多台計算裝置構成的集群。光標籤可以是離線的，也即，光標籤不需要與伺服器進行通訊。當然，可以理解，能夠與伺服器進行通訊的在線光標籤也是可行的。 The identification signal (ID) or other signals of each optical label can be saved on the server, such as the service signal related to the optical label, the description signal or attribute related to the optical label, such as the position signal and model signal of the optical label , Physical size signal, physical shape signal, attitude or orientation signal, etc. Optical tags can also have a uniform or default physical size signal and physical Shape signal, etc. The device can use the identified identification signal of the optical label to query the server to obtain other signals related to the optical label. The position signal of the optical tag can refer to the actual position of the optical tag in the physical world, which can be indicated by the geographic coordinate signal. The server can be a software program running on a computing device, a computing device, or a cluster composed of multiple computing devices. The optical tag can be offline, that is, the optical tag does not need to communicate with the server. Of course, it can be understood that online optical tags that can communicate with the server are also feasible.

圖2示出了根據一個實施例的包含用於為目標設置可呈現的虛擬對象的系統的現實場景，該現實場景例如可以是一個銀行網點，其中有三個用戶，分別是第一用戶、第二用戶和第三用戶，第一用戶可以是銀行網點工作人員，第二用戶和第三用戶可以是銀行客戶。第一用戶攜帶有能夠識別光標籤的設備，例如手機或者AR眼鏡。 Figure 2 shows a real scene including a system for setting presentable virtual objects for a target according to an embodiment. The real scene can be, for example, a bank branch, in which there are three users, namely the first user and the second user. The user and the third user, the first user may be a bank branch staff, and the second user and the third user may be bank customers. The first user carries a device capable of recognizing the optical tag, such as a mobile phone or AR glasses.

所述系統包括攝像頭、光標籤以及伺服器(圖2中未示出)，其中，攝像頭和光標籤各自以特定的位置和姿態(下文中可以統稱為“位姿”)安裝於現實場景中。在一個實施例中，伺服器可以獲得攝像頭和光標籤各自的位姿訊號，並且可以基於攝像頭和光標籤各自的位姿訊號得到攝像頭和光標籤之間的相對位姿訊號。在一個實施例中，伺服器也可以直接得到攝像頭和光標籤之間的相對位姿訊號。如此，伺服器可以獲得攝像頭座標系和光標籤座標系之間的變換矩陣，該變換矩陣例如可以包括兩個座標系之間的旋轉矩陣R和位移向量t。通過攝像頭座標系和光標籤座標系之間的變換矩陣，可以將一個座標系中的座標轉換為另一個座標系中的座標。在一個實施例中，可以在安裝攝像頭和光標籤時，對兩者的位姿訊號進行人工標定，並將位姿訊號存儲於伺服器。攝像頭可以是安裝於固定位置並且具有固定朝向的攝像頭，但可以理解，攝像頭也可以是可移動(例如，可以改變位置或調整方向)的攝像頭，只要能夠確定其當前位姿訊號即可。可以由伺服器設置攝像頭的當前位姿訊號，並基於該位姿訊號控制攝像頭的移動，也可以由攝像頭本身或者其他裝置來控制攝像頭的移動，並將攝像頭的當前位姿訊號發送給伺服器。在一些實施例中，系統中可以包括不止一個攝像頭，也可以包括不止一個光標籤。 The system includes a camera, an optical tag, and a server (not shown in FIG. 2), wherein the camera and the optical tag are each installed in a real scene in a specific position and posture (hereinafter collectively referred to as "pose"). In one embodiment, the server may obtain the respective pose signals of the camera and the optical tag, and may obtain the relative pose signals between the camera and the optical tag based on the respective pose signals of the camera and the optical tag. In one embodiment, the server can also directly obtain the relative pose signal between the camera and the optical tag. In this way, the server can obtain a transformation matrix between the camera coordinate system and the optical label coordinate system, and the transformation matrix may include, for example, a rotation matrix R and a displacement vector t between the two coordinate systems. Through the transformation matrix between the camera coordinate system and the optical label coordinate system, the coordinates in one coordinate system can be converted to the coordinates in another coordinate system. In one embodiment, when the camera and the optical tag are installed, the pose signals of the two can be manually calibrated, and the pose signals can be stored in the server. The camera may be a camera that is installed at a fixed position and has a fixed orientation. However, it is understood that the camera may also be a movable (for example, the position or direction can be changed), as long as the current position and attitude signal can be determined. The current pose signal of the camera can be set by the server, and the movement of the camera can be controlled based on the pose signal, or the movement of the camera can be controlled by the camera itself or other devices, and the current pose signal of the camera can be sent to the server. In some embodiments, the system may include more than one camera, or more than one optical tag.

在一個實施例中，可以為現實場景建立一個場景座標系(其也可稱為現實世界座標系)，並且可以基於攝像頭在現實場景中的位姿訊號來確定攝像頭座標系與場景座標系之間的變換矩陣，以及基於光標籤在現實場景中的位姿訊號來確定光標籤座標系與場景座標系之間的變換矩陣。在這種情況下，可以將攝像頭座標系或光標籤座標系中的座標轉換為場景座標系中的座標，而不在攝像頭座標系和光標籤座標系之間進行變換，但是可以理解，攝像頭和光標籤之間的相對位姿訊號或變換矩陣仍然能夠被伺服器知悉。因此，在本申請中，在攝像頭和光標籤之間具有相對位姿是指客觀上在兩者之間存在相對位姿，而並不要求系統預先存儲上述兩者之間的相對位姿訊號或者使用該相對位姿訊號。例如，一個實施例中，在系統中可以僅存儲攝像頭和光標籤各自在場景座標系中的位姿訊號，並且可以不計算或者使用兩者的相對位姿。 In one embodiment, a scene coordinate system (which may also be referred to as a real world coordinate system) can be established for the real scene, and the camera coordinate system and the scene coordinate system can be determined based on the pose signal of the camera in the real scene The transformation matrix of the light tag and the transformation matrix between the light tag coordinate system and the scene coordinate system are determined based on the pose signal of the light tag in the real scene. In this case, the coordinates in the camera coordinate system or the optical label coordinate system can be converted to the coordinates in the scene coordinate system, instead of transforming between the camera coordinate system and the optical label coordinate system, but it is understandable that the camera and the optical label The relative pose signal or transformation matrix can still be known by the server. Therefore, in this application, having a relative pose between the camera and the light tag means that there is a relative pose between the two objectively, and does not require the system to pre-store the relative pose signal between the two or use The relative pose signal. For example, in one embodiment, only the pose signals of the camera and the light tag in the scene coordinate system may be stored in the system, and the relative poses of the two may not be calculated or used.

攝像頭用於跟蹤現實場景中的目標，該目標可以是靜止的或移動的，其例如可以是場景中的人員、靜止物體、可移動的物體、等等。對於圖2所示的場景，系統可以通過攝像頭來跟蹤其中的第一用戶、第二用戶和第三用戶的位置。攝像頭例如可以是單眼攝像頭、雙眼攝像頭、或者其他形式的攝像頭。可以使用攝像頭通過現有技術中的各種方法來跟蹤現實場景中的人或物體的位置。例如，對於使用單個單目攝像頭的情況，可以結合場景訊號(例如，場景中的人或物體所處的平面的訊號)來確定場景中的目標的位置訊號。對於使用雙眼攝像頭的情況，可以根據目標在攝像頭視野中的位置以及目標的深度訊號，來確定目標的位置訊號。對於使用多個攝像頭的情況，可以根據目標在各個攝像頭視野中的位置，來確定目標的位置訊號。 The camera is used to track a target in a real scene. The target may be stationary or moving. For example, it may be a person, a stationary object, a movable object, etc. in the scene. For the scene shown in Figure 2, the system can track the positions of the first user, the second user, and the third user through a camera. The camera may be, for example, a monocular camera, a binocular camera, or other forms of camera. The camera can be used to track the position of a person or an object in a real scene through various methods in the prior art. For example, in the case of using a single monocular camera, the scene signal (for example, the signal of the plane where the person or object in the scene is located) can be combined to determine the position signal of the target in the scene. In the case of using a binocular camera, the position signal of the target can be determined according to the position of the target in the field of view of the camera and the depth signal of the target. In the case of using multiple cameras, the position signal of the target can be determined according to the position of the target in the field of view of each camera.

在一個實施例中，確定場景中的目標的位置訊號的過程可以由攝像頭來執行，並可以將相應的結果發送給伺服器。在另一個實施例中，可以由伺服器根據攝像頭拍攝的圖像來確定場景中的目標的位置訊號。伺服器可以將所確定的目標的位置訊號轉換為在光標籤座標系或場景座標系下的位置訊號。在獲得了目標的位置訊號之後，伺服器可以設置與目標相關聯的具有空間位置訊號的虛擬對象，虛擬對象的空間位置訊號可以基於目標的位置訊號來確定。例如，可以將虛擬對象的空間位置設置為位於相應目標的正上方，或者位於目標附近的其他位置。 In one embodiment, the process of determining the position signal of the target in the scene can be executed by the camera, and the corresponding result can be sent to the server. In another embodiment, the server may determine the position signal of the target in the scene according to the image taken by the camera. The server can convert the determined position signal of the target into a position signal in the optical label coordinate system or the scene coordinate system. After obtaining the position signal of the target, the server can set the virtual object with spatial position signal associated with the target, and the space of the virtual object The location signal can be determined based on the location signal of the target. For example, the spatial position of the virtual object can be set to be directly above the corresponding target, or to be located in another location near the target.

在設置了虛擬對象之後，伺服器可以將虛擬對象的相關訊號發送給某個能夠識別光標籤的設備。對於圖2所示的場景，伺服器可以將與第二用戶關聯的虛擬對象以及與第三用戶關聯的虛擬對象發送給第一用戶攜帶的光標籤識別設備。光標籤識別設備上具有攝像頭和顯示媒介，識別設備通過對光標籤進行掃描，可以確定其相對於光標籤的位置和姿態訊號，該位置和姿態訊號也可以被進一步轉換為場景座標系下的位置和姿態訊號。之後，識別設備能夠使用虛擬對象的相關訊號以基於其位置訊號和姿態訊號在其顯示媒介上的合適位置處呈現相應的虛擬對象。 After the virtual object is set, the server can send the relevant signal of the virtual object to a device that can recognize the optical tag. For the scene shown in FIG. 2, the server may send the virtual object associated with the second user and the virtual object associated with the third user to the optical tag recognition device carried by the first user. The optical label recognition device has a camera and a display medium. The recognition device can determine the position and posture signal relative to the optical label by scanning the optical label. The position and posture signal can also be further converted into the position under the scene coordinate system. And gesture signals. After that, the recognition device can use the relevant signal of the virtual object to present the corresponding virtual object at a suitable position on its display medium based on its position signal and posture signal.

可以理解，第二用戶或第三用戶也可以攜帶光標籤識別設備，並可以以與第一用戶類似的方式通過光標籤識別設備的顯示媒介觀察到與其他用戶關聯的虛擬對象。本發明的方案特別適合於場景中的多人AR交互。 It can be understood that the second user or the third user may also carry the optical tag identification device, and can observe virtual objects associated with other users through the display medium of the optical tag identification device in a similar manner to the first user. The solution of the present invention is particularly suitable for multi-person AR interaction in the scene.

圖3示出了根據一個實施例的用於為目標設置可呈現的虛擬對象的方法，其可以使用上述系統來實現，並且可以包括如下步驟： Fig. 3 shows a method for setting a presentable virtual object for a target according to an embodiment, which can be implemented using the above-mentioned system, and can include the following steps:

步驟301：使用攝像頭跟蹤現實場景中的目標。 Step 301: Use the camera to track the target in the real scene.

通過使用攝像頭針對現實場景持續地採集圖像，可以對現實場景中存在的目標進行視覺跟蹤。基於攝像頭的視覺跟蹤技術涉及對圖像序列中的目標進行檢測、提取、識別或跟蹤，以獲得目標的位置、姿態、速度、加速度或運動軌跡等。視覺跟蹤技術屬本領域的現有技術，在此不再贅述。 By using the camera to continuously collect images for the real scene, it is possible to visually track the target existing in the real scene. Camera-based visual tracking technology involves detecting, extracting, identifying, or tracking targets in image sequences to obtain the target's position, posture, velocity, acceleration, or motion trajectory. The visual tracking technology belongs to the existing technology in the field, and will not be repeated here.

在一個實施例中，攝像頭在跟蹤目標時，可以僅進行連續圖像採集，並將所採集的圖像作為跟蹤結果提供給伺服器，之後，可以由伺服器分析這些圖像並確定各個目標的位置訊號。在另一個實施例中，攝像頭還可以對所採集的圖像執行進一步的處理，例如圖像處理、目標檢測、目標提取、目標識別、確定目標位置或姿態等，相應的處理結果可以作為跟蹤結果被提供給伺服器。 In one embodiment, when the camera is tracking a target, it can only perform continuous image collection and provide the collected images to the server as the tracking result. After that, the server can analyze these images and determine the target's Position signal. In another embodiment, the camera can also perform further processing on the collected images, such as image processing, target detection, target extraction, target recognition, determination of target position or posture, etc., and the corresponding processing The result can be provided to the server as the tracking result.

步驟302：根據攝像頭的跟蹤結果獲得所述目標的位置訊號。 Step 302: Obtain the position signal of the target according to the tracking result of the camera.

伺服器可以從攝像頭接收跟蹤結果，並根據該跟蹤結果獲得現實場景中的目標的位置訊號。在一個實施例中，伺服器最終獲得的目標位置訊號可以是目標在攝像頭座標系下的位置訊號、目標在光標籤座標系下的位置訊號、或者是目標在場景座標系下的位置訊號。伺服器可以根據不同座標系之間的變換矩陣來實現目標位置在不同座標系之間的轉換。例如，伺服器可以首先根據攝像頭的跟蹤結果獲得目標相對於所述攝像頭的位置訊號(也即，在攝像頭座標系下的位置訊號)，然後可以根據目標相對於攝像頭的位置訊號以及攝像頭和光標籤之間的相對位姿訊號來確定目標相對於光標籤的位置訊號(也即，在光標籤座標系下的位置訊號)，也可以根據目標相對於攝像頭的位置訊號以及攝像頭在現實場景中的位姿訊號來確定目標在現實場景中的位置訊號(也即，在場景座標系下的位置訊號)。 The server may receive the tracking result from the camera, and obtain the position signal of the target in the real scene according to the tracking result. In one embodiment, the target position signal finally obtained by the server may be the target position signal in the camera coordinate system, the target position signal in the optical label coordinate system, or the target position signal in the scene coordinate system. The server can realize the conversion of the target position between different coordinate systems according to the transformation matrix between different coordinate systems. For example, the server can first obtain the position signal of the target relative to the camera (that is, the position signal in the camera coordinate system) according to the tracking result of the camera, and then can according to the position signal of the target relative to the camera and the difference between the camera and the optical label. To determine the position signal of the target relative to the light tag (that is, the position signal in the light tag coordinate system), it can also be based on the position signal of the target relative to the camera and the camera’s position in the real scene. The signal is used to determine the position signal of the target in the real scene (that is, the position signal in the scene coordinate system).

在一個實施例中，除了獲得目標的位置訊號之外，伺服器還可以根據攝像頭的跟蹤結果獲得目標的姿態訊號，例如，人或者物體的朝向等。 In one embodiment, in addition to obtaining the position signal of the target, the server may also obtain the posture signal of the target according to the tracking result of the camera, for example, the orientation of a person or an object.

步驟303：設置與所述目標相關聯的具有空間位置訊號的虛擬對象。 Step 303: Set up a virtual object with a spatial position signal associated with the target.

在確定了目標的位置訊號之後，伺服器可以為該目標設置相關聯的虛擬對象，該虛擬對象具有空間位置訊號，其可以基於目標的位置訊號來確定。例如，可以將虛擬對象的空間位置配置為位於目標位置上方的預定距離處。虛擬對象的空間位置訊號例如可以是其相對於光標籤的空間位置訊號，也可以是其在場景座標系中的位置訊號。 After determining the location signal of the target, the server can set an associated virtual object for the target. The virtual object has a spatial location signal, which can be determined based on the location signal of the target. For example, the spatial position of the virtual object may be configured to be located at a predetermined distance above the target position. The spatial position signal of the virtual object can be, for example, its spatial position signal relative to the light tag, or its position signal in the scene coordinate system.

在一個實施例中，伺服器除了配置虛擬對象的空間位置訊號之外，還可以配置任何其他與虛擬對象有關的訊號。與虛擬對象有關的訊號是用於描述該虛擬對象的相關訊號，例如可以包括虛擬對象中包含的圖片、文字、數字、圖標等，也可以包括虛擬對象的形狀訊號、顏色訊號、尺寸訊號、姿態訊號等。基於該訊號，設備可以呈現出相應的虛擬對象。伺服器可以根據與目標有關的訊號來配置對應於該目標的虛擬對象的相關訊號，如此，可以為各個目標定制相應的虛擬對象。 In one embodiment, in addition to configuring the spatial position signal of the virtual object, the server may also configure any other signals related to the virtual object. The signal related to the virtual object is the signal used to describe the virtual object. For example, it can include pictures, text, numbers, icons, etc. contained in the virtual object, and can also include the shape signal of the virtual object, Color signal, size signal, attitude signal, etc. Based on the signal, the device can present the corresponding virtual object. The server can configure the related signal of the virtual object corresponding to the target according to the signal related to the target, so that the corresponding virtual object can be customized for each target.

在一個實施例中，伺服器可以根據攝像頭的跟蹤結果來獲得與目標有關的訊號。例如，伺服器可以根據攝像頭的跟蹤結果判斷目標是人還是物體、目標是男人還是女人、目標是什麼物體、目標是移動的還是靜止的、目標的移動速度、目標的移動方向、等等。這些與目標有關的訊號可以用於配置對應於該目標的虛擬對象的相關訊號。 In one embodiment, the server can obtain the signal related to the target according to the tracking result of the camera. For example, the server can determine whether the target is a person or an object, whether the target is a man or a woman, what the target is, whether the target is moving or stationary, the moving speed of the target, the moving direction of the target, etc., according to the tracking result of the camera. These signals related to the target can be used to configure the related signals of the virtual object corresponding to the target.

在一個實施例中，也可以通過其他設施來獲得與目標有關的資訊。例如，可以通過指紋或靜脈採集設備、人臉識別設備、身份證讀取設備等設施來獲得人員的身份資訊，可以通過讀卡器等設施來獲得人員的性別資訊、職業資訊、身份資訊、會員卡資訊等。在這些設施獲得了人員資訊之後，伺服器可以從設施接收該資訊並將該資訊與攝像頭當前所跟蹤的位於預定位置(例如，這些設施的前方、後方、左側、右側、上方、下方、附近等)的人員建立聯繫，從而可以使用該人員的資訊(例如，性別資訊、職業資訊、身份資訊、會員卡資訊等)來配置與該人員對應的虛擬對象的相關資訊。類似地，也可以通過場景中的設施來獲得物體的相關資訊，並用該資訊配置與該物體對應的虛擬對象的相關資訊。 In one embodiment, other facilities may also be used to obtain information related to the target. For example, fingerprint or vein collection equipment, face recognition equipment, ID card reading equipment and other facilities can be used to obtain personnel’s identity information, and card readers and other facilities can be used to obtain personnel’s gender information, occupational information, identity information, and membership Card information, etc. After these facilities have obtained personnel information, the server can receive the information from the facilities and place the information at a predetermined location currently tracked by the camera (for example, the front, back, left, right, upper, lower, nearby, etc. of these facilities) ) To establish a connection, so that the person’s information (for example, gender information, occupation information, identity information, membership card information, etc.) can be used to configure the relevant information of the virtual object corresponding to the person. Similarly, the related information of the object can also be obtained through the facilities in the scene, and the related information of the virtual object corresponding to the object can be configured with the information.

在一個實施例中，伺服器還可以設置虛擬對象的姿態訊號，虛擬對象的姿態訊號可以基於目標的姿態訊號來設置，但也可以通過其他方式來設置。 In one embodiment, the server can also set the posture signal of the virtual object. The posture signal of the virtual object can be set based on the posture signal of the target, but it can also be set in other ways.

步驟304：將所述虛擬對象的相關訊號發送給能夠識別光標籤的設備，該訊號中包括所述虛擬對象的空間位置訊號。 Step 304: Send the relevant signal of the virtual object to a device capable of recognizing the optical tag, and the signal includes the spatial position signal of the virtual object.

在為目標設置了虛擬對象之後，伺服器可以將虛擬對象的相關訊號發送給能夠識別光標籤的設備，該設備例如可以是手機、AR眼鏡等，其具有攝像頭和顯示媒介。虛擬對象的相關訊號中包括虛擬對象的空間位置訊號，在一個實施例中，還可以包括虛擬對象的姿態訊號。 After setting the virtual object for the target, the server can send the relevant signal of the virtual object to a device capable of recognizing the optical tag. The device may be a mobile phone, AR glasses, etc., which has a camera and a display medium. The related signal of the virtual object includes the spatial position signal of the virtual object. In one embodiment, it may also include the posture signal of the virtual object.

在一個實施例中，可以根據與設備有關的訊號確定要發送給該設備的所述虛擬對象的相關訊號。與設備有關的訊號可以包括該設備本身的相關訊號、該設備的用戶的相關訊號、等等。例如，可以根據設備用戶的不同權限或者不同等級，選擇虛擬對象的一部分或者全部相關訊號發送給設備。 In one embodiment, the signal to be sent can be determined based on the signal related to the device. The relevant signal of the virtual object sent to the device. The signal related to the device may include the related signal of the device itself, the related signal of the user of the device, and so on. For example, according to different permissions or different levels of the device user, a part or all of the related signals of the virtual object can be selected and sent to the device.

在一個實施例中，光標籤識別設備可以通過掃描場景中佈置的光標籤來識別光標籤傳遞的訊號(例如標識訊號)，並使用該訊號訪問伺服器(例如，通過無線信號進行訪問)以從伺服器獲得虛擬對象的相關訊號。在一個實施例中，伺服器可以使用光標籤以光通訊方式將虛擬對象的相關訊號發送到光標籤識別設備。虛擬對象的相關訊號能夠被光標籤識別設備使用以基於其通過光標籤確定的位置訊號和姿態訊號在其顯示媒介上呈現所述虛擬對象。 In one embodiment, the optical label identification device can identify the signal (for example, identification signal) transmitted by the optical label by scanning the optical label arranged in the scene, and use the signal to access the server (for example, access via wireless signal) to obtain The server obtains the relevant signal of the virtual object. In one embodiment, the server may use the optical tag to send the relevant signal of the virtual object to the optical tag identification device in optical communication. The related signal of the virtual object can be used by the optical tag recognition device to present the virtual object on its display medium based on its position signal and posture signal determined by the optical tag.

可以通過光標籤來確定設備的位置和姿態訊號，該位置和姿態訊號可以是設備相對於光標籤的位置和姿態訊號(也即，在光標籤座標系下的位置和姿態訊號)，但也可以是被轉換到其他座標系(例如場景座標系)下的位置和姿態訊號。 The position and attitude signal of the device can be determined by the optical tag. The position and attitude signal can be the position and attitude signal of the device relative to the optical tag (that is, the position and attitude signal in the optical tag coordinate system), but it can also be It is the position and attitude signal converted to other coordinate system (such as scene coordinate system).

在一個實施例中，設備可以通過採集包括光標籤的圖像並分析該圖像來確定其相對於光標籤的位置訊號。例如，設備可以通過圖像中的光標籤成像大小以及可選的其他訊號(例如，光標籤的實際物理尺寸訊號、設備的攝像頭的焦距)來確定光標籤與設備的相對距離(成像越大，距離越近；成像越小，距離越遠)。設備可以使用光標籤的標識訊號從伺服器獲得光標籤的實際物理尺寸訊號，或者光標籤可以具有統一的物理尺寸並在設備上存儲該物理尺寸。設備可以通過包括光標籤的圖像中的光標籤成像的透視變形以及可選的其他訊號(例如，光標籤的成像位置)，來確定設備相對於光標籤的方向訊號。設備可以使用光標籤的標識訊號從伺服器獲得光標籤的物理形狀訊號，或者光標籤可以具有統一的物理形狀並在設備上存儲該物理形狀。在一個實施例中，設備也可以通過其上安裝的深度攝像頭或雙眼攝像頭等來直接獲得光標籤與設備的相對距離。設備也可以採用現有的任何其他定位方法來確定其相對於光標籤的位置訊號。 In one embodiment, the device can determine its position signal relative to the optical label by collecting an image including the optical label and analyzing the image. For example, the device can determine the relative distance between the optical label and the device through the image size of the optical label in the image and other optional signals (for example, the actual physical size signal of the optical label, the focal length of the device's camera) (the larger the image, the The closer the distance; the smaller the imaging, the farther the distance). The device may use the identification signal of the optical label to obtain the actual physical size signal of the optical label from the server, or the optical label may have a uniform physical size and store the physical size on the device. The device can determine the direction signal of the device relative to the optical label through the perspective deformation of the optical label imaging in the image including the optical label and optional other signals (for example, the imaging position of the optical label). The device may use the identification signal of the optical label to obtain the physical shape signal of the optical label from the server, or the optical label may have a uniform physical shape and store the physical shape on the device. In an embodiment, the device can also directly obtain the relative distance between the optical tag and the device through a depth camera or binocular camera installed on it. The device can also use any other existing positioning method to determine its position signal relative to the optical tag.

在一個實施例中，設備可以掃描光標籤，並可以根據光標籤的成像來確定其相對於光標籤的姿態訊號，當光標籤的成像位置或成像區域位於第二設備成像視野的中心時，可以認為第二設備當前正對著光標籤。在確定設備的姿態時可以進一步考慮光標籤的成像的方向。隨著設備的姿態發生改變，光標籤在設備上的成像位置和/或成像方向會發生相應的改變，因此，可以根據光標籤在設備上的成像來獲得設備相對於光標籤的姿態訊號。 In one embodiment, the device can scan the optical tag and determine its attitude signal relative to the optical tag based on the imaging of the optical tag. When the imaging position or imaging area of the optical tag is located in the center of the imaging field of the second device, it can It is assumed that the second device is currently facing the optical tag. When determining the posture of the device, the imaging direction of the optical tag can be further considered. As the posture of the device changes, the imaging position and/or imaging direction of the optical tag on the device will change accordingly. Therefore, the posture signal of the device relative to the optical tag can be obtained according to the imaging of the optical tag on the device.

在一個實施例中，也可以以如下方式來確定設備相對於光標籤的位置和姿態訊號。具體地，可以根據光標籤建立一個座標系，該座標系可以被稱為光標籤座標系。可以將光標籤上的一些點確定為在光標籤座標系中的一些空間點，並且可以根據光標籤的物理尺寸訊號和/或物理形狀訊號來確定這些空間點在光標籤座標系中的座標。光標籤上的一些點例如可以是光標籤的外殼的角、光標籤中的光源的端部、光標籤中的一些標識點、等等。根據光標籤的物體結構特徵或幾何結構特徵，可以在設備相機拍攝的圖像中找到與這些空間點分別對應的像點，並確定各個像點在圖像中的位置。根據各個空間點在光標籤座標系中的座標以及對應的各個像點在圖像中的位置，結合設備相機的內參訊號，可以計算得到拍攝該圖像時設備相機在光標籤座標系中的位姿訊號(R，t)，其中R為旋轉矩陣，其可以用於表示設備相機在光標籤座標系中的姿態訊號，t為位移向量，其可以用於表示設備相機在光標籤座標系中的位置訊號。計算R、t的方法在現有技術中是已知的，例如，可以利用3D-2D的PnP(Perspective-n-Point)方法來計算R、t，為了不模糊本發明，在此不再詳細介紹。 In an embodiment, the position and attitude signal of the device relative to the optical tag can also be determined in the following manner. Specifically, a coordinate system can be established based on the optical label, and this coordinate system can be referred to as the optical label coordinate system. Some points on the optical label can be determined as some spatial points in the optical label coordinate system, and the coordinates of these spatial points in the optical label coordinate system can be determined according to the physical size signal and/or physical shape signal of the optical label. Some points on the optical label may be, for example, the corners of the housing of the optical label, the end of the light source in the optical label, some identification points in the optical label, and so on. According to the object structure feature or geometric structure feature of the optical tag, the image points corresponding to these spatial points can be found in the image taken by the device camera, and the position of each image point in the image can be determined. According to the coordinates of each spatial point in the optical label coordinate system and the position of each corresponding image point in the image, combined with the internal reference signal of the device camera, the position of the device camera in the optical label coordinate system can be calculated when the image is taken. Attitude signal (R, t), where R is the rotation matrix, which can be used to represent the posture signal of the device camera in the optical label coordinate system, and t is the displacement vector, which can be used to represent the device camera’s position in the optical label coordinate system Position signal. The method of calculating R and t is known in the prior art. For example, the 3D-2D PnP (Perspective-n-Point) method can be used to calculate R and t. In order not to obscure the present invention, it will not be described in detail here. .

在設備通過光標籤獲得了位置和姿態訊號之後，可以例如使用設備內置的加速度傳感器、陀螺儀、攝像頭等通過本領域已知的方法(例如，慣性導航、視覺里程計、SLAM、VSLAM、SFM等)來測量或跟蹤設備的位置和姿態變化，從而獲得設備新的位置和姿態訊號。 After the device obtains the position and attitude signals through the optical tag, for example, the built-in acceleration sensor, gyroscope, camera, etc. of the device can be used through methods known in the art (for example, inertial navigation, visual odometry, SLAM, VSLAM, SFM, etc.) ) To measure or track changes in the device's position and attitude, so as to obtain a new position and attitude signal of the device.

在獲得了虛擬對象的空間位置訊號以及設備的位置和姿態訊號之後，便可以在通過設備的顯示媒介呈現的現實場景中的合適位置處疊加該虛擬對象。在虛擬對象具有姿態訊號的情況下，可以進一步確定所疊加的虛擬對象的姿態。虛擬對象的姿態可以隨著設備相對於虛擬對象的位置和/姿態而調整，例如使得虛擬對象的某個方向(例如虛擬對象的正面方向)始終朝向設備。在一個實施例中，可以基於設備和虛擬對象的位置在空間中確定一個從虛擬對象到設備的方向，並基於該方向來確定虛擬對象的姿態。通過上述方法，同一個虛擬對象對於處於不同位置的設備實際上可以具有各自的姿態。 After obtaining the spatial position signal of the virtual object and the position and posture signal of the device, it can be positioned in the appropriate position in the real scene presented by the display medium of the device. Place the virtual object superimposed. In the case that the virtual object has a posture signal, the posture of the superimposed virtual object can be further determined. The posture of the virtual object can be adjusted with the position and/or posture of the device relative to the virtual object, for example, a certain direction of the virtual object (for example, the front direction of the virtual object) always faces the device. In one embodiment, a direction from the virtual object to the device may be determined in space based on the positions of the device and the virtual object, and the pose of the virtual object may be determined based on the direction. Through the above method, the same virtual object can actually have their own postures for devices in different positions.

在一個實施例中，在疊加了虛擬對象之後，設備用戶可以對該虛擬對象執行各種交互操作。例如，設備用戶可以點擊虛擬對象以查閱其詳情、改變虛擬對象的姿態、改變虛擬對象的大小或顏色、在虛擬對象上添加標注、等等。在一個實施例中，在設備用戶改變了虛擬對象的屬性之後，可以把虛擬對象的修改後的屬性訊號上傳到伺服器。伺服器可以基於修改後的屬性訊號來更新其存儲的虛擬對象的相關訊號。 In one embodiment, after the virtual object is superimposed, the device user can perform various interactive operations on the virtual object. For example, the device user can click on the virtual object to view its details, change the posture of the virtual object, change the size or color of the virtual object, add annotations on the virtual object, and so on. In one embodiment, after the device user changes the attribute of the virtual object, the modified attribute signal of the virtual object can be uploaded to the server. The server can update the related signal of the virtual object stored in it based on the modified attribute signal.

圖3所示的方法可以持續執行或者定期重複執行，使得虛擬對象能夠始終跟蹤場景中的目標。 The method shown in FIG. 3 can be executed continuously or repeatedly executed periodically, so that the virtual object can always track the target in the scene.

在一個實施例中，位於現實場景中的跟蹤目標可以是設備或者持有該設備的人，並可以根據該設備的相關訊號來設置與相應虛擬對象有關的訊號。具體地，設備可以例如使用上文提到的方式通過光標籤來確定其位置訊號，並將該位置訊號發送到伺服器，該位置訊號可以是設備相對於光標籤的位置訊號。設備所發送的位置訊號可以是設備在掃描光標籤時獲得的位置訊號，也可以是設備在掃描光標籤之後使用內置的加速度傳感器、陀螺儀、攝像頭等通過本領域已知的方法(例如，慣性導航、視覺里程計、SLAM、VSLAM、SFM等)測量或跟蹤獲得的新的位置訊號。另外，設備還可以將一些其他訊號發送給伺服器，例如，設備標識號、設備用戶名、設備所有人的職業資訊、設備所有人的身份資訊、設備所有人的性別資訊、設備所有人的年齡資訊、設備上某個應用的賬戶資訊、與設備所執行的某個操作有關的訊號、等等。伺服器可以將設備的位置訊號與根據攝像頭的跟蹤結果確定的一個或多個目標的位置訊號進行比較，以確定與設備匹配的目標。例如，伺服器可以從一個或多個目標的位置訊號中選擇與設備的位置訊號最接近的一個，並認為相應的目標與設備匹配。在確定了與設備匹配的目標之後，伺服器可以根據來自設備的訊號配置對應於該目標的虛擬對象的相關訊號。通過上述方式，場景中的設備(例如，機器人、無人車等)或者持有設備(例如，手機、AR眼鏡等)的人員可以自主地向伺服器上傳相關訊號，從而不再需要使用專門的設施(例如，刷卡器、指紋採集設備等)來獲得與設備或人員有關的訊號。 In one embodiment, the tracking target in the real scene may be a device or a person holding the device, and a signal related to the corresponding virtual object may be set according to the related signal of the device. Specifically, the device can, for example, use the above-mentioned method to determine its location signal through the optical tag, and send the location signal to the server. The location signal can be the location signal of the device relative to the optical tag. The position signal sent by the device can be the position signal obtained when the device scans the optical tag, or it can be the device after scanning the optical tag using a built-in acceleration sensor, gyroscope, camera, etc. through methods known in the art (for example, inertial Navigation, visual odometer, SLAM, VSLAM, SFM, etc.) measure or track the new position signal obtained. In addition, the device can also send some other signals to the server, such as device identification number, device user name, device owner’s occupational information, device owner’s identity information, device owner’s gender information, and device owner’s age Information, account information of an application on the device, signals related to an operation performed by the device, etc. The server can combine the position signal of the device with one or more targets determined according to the tracking result of the camera. The location signal of the target is compared to determine the target that matches the device. For example, the server may select the position signal of one or more targets that is closest to the position signal of the device, and consider that the corresponding target matches the device. After determining the target matching the device, the server can configure the relevant signal of the virtual object corresponding to the target according to the signal from the device. Through the above method, the equipment in the scene (for example, robots, unmanned vehicles, etc.) or persons with equipment (for example, mobile phones, AR glasses, etc.) can upload relevant signals to the server autonomously, eliminating the need to use special facilities (For example, card swiping devices, fingerprint collection devices, etc.) to obtain signals related to equipment or personnel.

圖4示出了根據一個實施例的由圖2中的第一用戶通過手機螢幕或者AR眼鏡觀察到的示意圖像，其中包含第二用戶和第三用戶。如上文提到的，第一用戶可以是銀行網點工作人員，第二用戶和第三用戶可以是銀行客戶。如圖4所示，第一用戶通過手機螢幕或者AR眼鏡不僅能夠觀察到第二用戶和第三用戶，還可以觀察到各個用戶頭上疊加的虛擬對象(例如，“VIP”、“普通”等)，這些對象可以用來指示用戶的客戶等級，例如，是VIP客戶還是普通客戶。用戶的客戶等級例如可以通過讀取用戶的身份資訊、銀行卡資訊等來獲取，也可以通過從用戶手機接收其電話號碼資訊、銀行賬戶資訊等來獲取。 Fig. 4 shows a schematic image observed by the first user in Fig. 2 through a mobile phone screen or AR glasses according to an embodiment, which includes a second user and a third user. As mentioned above, the first user may be a bank branch staff, and the second and third users may be bank customers. As shown in Figure 4, the first user can not only observe the second user and the third user through the mobile phone screen or AR glasses, but also the virtual objects superimposed on each user's head (for example, "VIP", "normal", etc.) , These objects can be used to indicate the user’s customer level, for example, whether it is a VIP customer or an ordinary customer. The customer level of the user can be obtained, for example, by reading the user's identity information, bank card information, etc., or by receiving the user's phone number information, bank account information, etc. from the user's mobile phone.

在本發明的一個實施例中，可以以計算機程式的形式來實現本發明。計算機程式可以存儲於各種存儲介質(例如，硬碟、光碟、快閃記憶體等)中，當該計算機程式被處理器執行時，能夠用於實現本發明的方法。 In an embodiment of the present invention, the present invention can be implemented in the form of a computer program. The computer program can be stored in various storage media (for example, a hard disk, an optical disk, a flash memory, etc.), and when the computer program is executed by a processor, it can be used to implement the method of the present invention.

在本發明的另一個實施例中，可以以電子設備的形式來實現本發明。該電子設備包括處理器和存儲器，在存儲器中存儲有計算機程式，當該計算機程式被處理器執行時，能夠用於實現本發明的方法。 In another embodiment of the present invention, the present invention may be implemented in the form of an electronic device. The electronic device includes a processor and a memory, and a computer program is stored in the memory. When the computer program is executed by the processor, it can be used to implement the method of the present invention.

本文中針對“各個實施例”、“一些實施例”、“一個實施例”、或“實施例”等的參考指代的是結合所述實施例所描述的特定特徵、結構、或性質包括在至少一個實施例中。因此，短語“在各個實施例中”、“在一些實施例中”、“在一個實施例中”、或“在實施例中”等在整個本文中各處的出現並非必須指代相同的實施例。此外，特定特徵、結構、或性質可以在一個或多個實施例中以任何合適方式組合。因此，結合一個實施例中所示出或描述的特定特徵、結構或性質可以整體地或部分地與一個或多個其他實施例的特徵、結構、或性質無限制地組合，只要該組合不是不符合邏輯的或不能工作。本文中出現的類似於“根據A”、“基於A”、“通過A”或“使用A”的表述意指非排他性的，也即，“根據A”可以涵蓋“僅僅根據A”，也可以涵蓋“根據A和B”，除非特別聲明其含義為“僅僅根據A”。在本申請中為了清楚說明，以一定的順序描述了一些示意性的操作步驟，但本領域技術人員可以理解，這些操作步驟中的每一個並非是必不可少的，其中的一些步驟可以被省略或者被其他步驟替代。這些操作步驟也並非必須以所示的方式依次執行，相反，這些操作步驟中的一些可以根據實際需要以不同的順序執行，或者並行執行，只要新的執行方式不是不符合邏輯的或不能工作。 References herein to "each embodiment", "some embodiments", "one embodiment", or "an embodiment", etc. refer to the specific features, structures, or properties described in connection with the embodiments that are included in In at least one embodiment. Therefore, the phrases "in various embodiments", "in some embodiments", "in one embodiment", or "in implementation The appearances of "in the example" in various places throughout this text do not necessarily refer to the same embodiment. In addition, specific features, structures, or properties can be combined in any suitable manner in one or more embodiments. Therefore, a combination of one implementation The specific features, structures, or properties shown or described in the examples can be combined in whole or in part with the features, structures, or properties of one or more other embodiments without limitation, as long as the combination is not illogical or impossible. Work. Expressions similar to "according to A", "based on A", "through A" or "using A" appearing in this article mean non-exclusive, that is, "according to A" can cover "only according to A", It can also cover "according to A and B", unless it is specifically stated that its meaning is "only according to A." In this application, for clarity, some illustrative operating steps are described in a certain order, but those skilled in the art can understand Each of these operating steps is not indispensable. Some of these steps can be omitted or replaced by other steps. These operating steps do not have to be performed sequentially in the manner shown. On the contrary, some of these operating steps It can be executed in a different order or in parallel according to actual needs, as long as the new execution method is not illogical or unable to work.

由此描述了本發明的至少一個實施例的幾個方面，可以理解，對本領域技術人員來說容易地進行各種改變、修改和改進。這種改變、修改和改進意於在本發明的精神和範圍內。雖然本發明已經通過優選實施例進行了描述，然而本發明並非局限於這裡所描述的實施例，在不脫離本發明範圍的情況下還包括所作出的各種改變以及變化。 Thus, several aspects of at least one embodiment of the present invention have been described, and it can be understood that various changes, modifications and improvements can be easily made by those skilled in the art. Such changes, modifications and improvements are intended to be within the spirit and scope of the present invention. Although the present invention has been described through preferred embodiments, the present invention is not limited to the embodiments described here, and also includes various changes and changes made without departing from the scope of the present invention.

雖然本發明以前述實施例揭露如上，然其並非用以限定本發明，任何熟習相像技藝者，在不脫離本發明之精神和範圍內，所作更動與潤飾之等效替換，仍為本發明之專利保護範圍內。 Although the present invention is disclosed in the foregoing embodiments as above, it is not intended to limit the present invention. Anyone who is familiar with similar skills, without departing from the spirit and scope of the present invention, makes changes and modifications equivalent to replacements, still belongs to the present invention. Within the scope of patent protection.

S301~S304:步驟流程 S301~S304: Step flow

Claims

A method for setting a presentable virtual object for a target in a real scene, wherein a camera and an optical communication device are installed in the real scene, and the camera and the optical communication device have a relative pose, the method includes :

Use the camera to track a target in the real scene;

Obtain a position signal of the target according to the tracking result of the camera;

Setting a virtual object with a spatial position signal associated with the target, wherein a spatial position signal of the virtual object is determined based on the position signal of the target; and

The signal related to the virtual object is sent to a first device, and the signal includes the spatial position signal of the virtual object, wherein the signal related to the virtual object can be used by the first device to pass through the optical communication device A determined position signal and a posture signal present the virtual object on its display medium.

The method described in item 1 of the scope of patent application, wherein the obtaining the position signal of the target according to the tracking result of the camera includes:

Obtain the position signal of the target relative to the camera according to the tracking result of the camera, and

The position signal of the target relative to the optical communication device is determined according to the position signal of the target relative to the camera and the relative pose between the camera and the optical communication device.

The method described in item 2 of the scope of patent application, wherein the spatial position signal of the virtual object is relative to the spatial position signal of the optical communication device.

The method described in item 1 of the scope of patent application, wherein obtaining the position signal of the target according to the tracking result of the camera includes:

The position signal of the target in the real scene is determined according to the position signal of the target relative to the camera and the pose signal of the camera in the real scene.

The method according to any one of items 1 to 4 in the scope of the patent application, wherein the signal related to the virtual object is configured according to the signal related to the target.

As the method described in item 5 of the scope of patent application, it also includes: according to the tracking result of the camera Obtain a signal related to the target.

For example, the method described in item 5 of the scope of patent application, wherein the related signal of configuring the virtual object according to the signal related to the target includes:

Receive signals related to the target from the facility;

Establishing contact with the signal and the target tracked by the camera at a predetermined position relative to the facility; and

According to the signal, the relevant signal of the virtual object of the target is configured.

Receiving a signal from a second device, the signal including a location signal of the second device;

Comparing the position signal of the second device with the position signal of one or more targets determined according to the tracking result of the camera to determine a target matching the second device; and

According to the signal from the second device, the related signal of the virtual object of the target matched with the second device is configured.

According to the method described in item 8 of the scope of patent application, the second device determines its position signal at least in part by collecting an image including the optical communication device and analyzing the image.

The method described in any one of items 1-4 in the scope of the patent application also includes:

Obtain the posture signal of the target according to the tracking result of the camera;

And wherein, the setting of the virtual object with the spatial position signal associated with the target includes setting the posture signal of the virtual object according to the posture signal of the target.

The method according to any one of items 1-4 in the scope of the patent application, wherein the first device at least partially collects an image including the optical communication device and analyzes the image to determine that it is relative to the optical communication device. The position signal and the attitude signal of the device.

A system for setting presentable virtual objects for targets in real scenes, including:

A camera installed in the real scene, which is used to track the target in the real scene;

An optical communication device installed in the real scene, wherein the optical communication device and the camera have relative poses; and

A server used to:

Obtain the position signal of the target according to the tracking result of the camera;

Set a virtual object associated with the target with a spatial position signal, where the virtual object The spatial position signal of the proposed object is determined based on the position signal of the target;

The signal related to the virtual object is sent to the first device, and the signal includes a spatial position signal of the virtual object, wherein the signal related to the virtual object can be used by the first device to determine it based on the optical communication device The position signal and posture signal of the virtual object are displayed on the display medium.

For example, the system described in item 12 of the scope of patent application, wherein the obtaining the position signal of the target according to the tracking result of the camera includes:

For the system described in any one of items 12-14 in the scope of the patent application, the first device is used for:

Receiving the relevant signal of the virtual object from the server;

Determining the position signal and the attitude signal of the first device at least in part through the optical communication device; and

The virtual object is presented on the display medium of the first device based on the position signal, the posture signal and the related signal of the virtual object.

A storage medium in which a computer program is stored. When the computer program is executed by the processor, it can be used to implement the method described in any one of items 1-11 in the scope of the patent application.

An electronic device, comprising a processor and a memory, the memory stores a computer program, and when the computer program is executed by the processor, it can be used to implement any one of items 1-11 in the scope of the patent application的方法。 The method.