CN111163281A - Panoramic video recording method and device based on voice tracking - Google Patents
Panoramic video recording method and device based on voice tracking Download PDFInfo
- Publication number
- CN111163281A CN111163281A CN202010021698.0A CN202010021698A CN111163281A CN 111163281 A CN111163281 A CN 111163281A CN 202010021698 A CN202010021698 A CN 202010021698A CN 111163281 A CN111163281 A CN 111163281A
- Authority
- CN
- China
- Prior art keywords
- video
- image
- panoramic video
- panoramic
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/2624—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects for obtaining an image which is composed of whole input images, e.g. splitscreen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/76—Television signal recording
- H04N5/765—Interface circuits between an apparatus for recording and another apparatus
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/18—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
- H04N7/181—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast for receiving images from a plurality of remote sources
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Studio Devices (AREA)
Abstract
The invention relates to a panoramic video recording method and a panoramic video recording device based on voice tracking, wherein a plurality of paths of audio signals and a plurality of paths of video signals are collected, and the plurality of paths of video signals are fused and spliced through a panoramic video to form a panoramic video image; estimating the sound source direction of a live speaker in real time according to the audio signal; intercepting a live speaker close-up image at a corresponding position in the panoramic video image according to the sound source direction, and integrating the live speaker close-up image and the panoramic video image to form a panoramic video output image; and uploading the audio signal and the panoramic video output image to an upper computer through a network or directly outputting the audio signal and the panoramic video output image through monitoring equipment. The method has simple flow, can effectively realize automatic generation of the panoramic image and the close-up image, and has real-time property.
Description
Technical Field
The invention relates to a panoramic video recording method and device based on voice tracking.
Background
In the prior art, most video conference equipment for panoramic video is complex in composition, manual switching is needed for recording and broadcasting of speakers, and automatic generation of panoramic images and close-up images cannot be realized. The prior art most relevant to the invention is a patent of invention named as a conference transcription system based on a panoramic camera and a microphone array (patent publication No. CN 109474797A), and the technical scheme has the defects of complex structure, complex flow of generating a panoramic image and an automatic close-up image and poor real-time property.
Disclosure of Invention
The invention aims to provide a panoramic video recording method and device based on voice tracking, which can effectively realize automatic generation of panoramic images and close-up images.
Based on the same inventive concept, the invention has two independent technical schemes:
1. a panoramic video recording method based on voice tracking is characterized by comprising the following steps:
step 1: collecting a plurality of paths of audio signals and a plurality of paths of video signals, and fusing and splicing the plurality of paths of video signals through panoramic videos to form a panoramic video image;
step 2: estimating the sound source direction of a live speaker in real time according to the audio signal; intercepting a live speaker close-up image at a corresponding position in the panoramic video image according to the sound source direction, and integrating the live speaker close-up image and the panoramic video image to form a panoramic video output image;
and step 3: and uploading the audio signal and the panoramic video output image to an upper computer through a network or directly outputting the audio signal and the panoramic video output image through monitoring equipment.
Further, step 3 further comprises: carrying out face recognition on the close-up image of the live speaker to recognize the identity of the speaker; and identifying the audio signal, converting the voice into characters, storing the data, and labeling the identity of the speaker to the data.
Further, the multi-channel audio signal is collected by a microphone array, and the multi-channel video signal is collected by a multi-channel video sensor.
Furthermore, the microphone array is composed of a plurality of microphones, wherein 1 microphone is positioned at the position of the circle center, and the rest microphones are uniformly distributed along the circumferential direction;
the multiple paths of video sensors are uniformly distributed along the circumferential direction;
the number and the position distribution of the microphones and the video sensors are matched with each other.
Further, step 2 further comprises: and enhancing the audio signal in the sound source direction by using a self-adaptive beam forming method, and eliminating the interference sound in other directions.
Further, in step 2, the sound source direction of the live speaker is estimated in real time by using the super-resolution spectrum.
Further, step 2 further comprises: judging whether a live speaker exists or not; and when the speaker is not present, taking the panoramic video image obtained in the step 1 as a panoramic video output image.
Further, in step 3, the audio signal and the video signal are subjected to data compression and then uploaded to an upper computer through a network. .
2. A panoramic video recording device based on voice tracking is characterized by comprising:
a housing;
the microphone array is arranged on the shell and used for acquiring multi-path audio signals;
the multi-channel video sensor is arranged on the shell and used for acquiring multi-channel video signals; and
set up the audio frequency video processing apparatus in the casing, including video processing module, audio frequency processing module, video recombination module and output module, wherein:
the video processing module acquires video signals acquired by the multiple paths of video sensors, and performs panoramic fusion and splicing to obtain a panoramic video image;
the audio processing module acquires multi-channel audio signals acquired by the microphone array, calculates the sound source direction of a speaker in real time, enhances the voice signals in the sound source direction and eliminates interference sounds in other directions;
the video recombination module intercepts local images at corresponding positions from the panoramic video image according to the sound source direction output by the audio processing module, and integrates the panoramic video image and the intercepted local images to generate a new image;
and the output module outputs the audio data processed by the audio processing module and the image generated by the video recombination module. .
The invention has the following beneficial effects:
the invention collects multi-channel audio signals through a microphone array; acquiring a plurality of paths of video signals through a plurality of paths of video sensors, and fusing and splicing panoramic videos to form a panoramic video image; estimating the sound source direction of a live speaker in real time according to the audio signal; intercepting a live speaker close-up image at a corresponding position in the panoramic video image according to the direction of a sound source, and integrating the live speaker close-up image and the panoramic video image to form a panoramic video output image; and transmitting the audio signal and the video signal of the panoramic video output image to an upper computer through a network or directly outputting the audio signal and the video signal through monitoring equipment. The method has simple flow, can effectively realize automatic generation of the panoramic image and the close-up image, and has real-time property.
The upper computer performs face recognition on the close-up image of the site speaker and recognizes the identity of the speaker; the voice recognition method has the advantages that the voice signals are recognized, the voice is converted into characters to be stored, and the identity of a speaker is marked on the data, so that the field recording is more convenient.
The microphone array consists of a plurality of microphones, wherein 1 microphone is positioned at the position of a circle center, and the rest microphones are uniformly distributed along the circumferential direction; the multiple paths of video sensors are uniformly distributed along the circumferential direction; the number and the position distribution of the microphones and the video sensors are matched with each other. According to the invention, the microphone array and the video sensors are distributed, so that the accurate positioning of the sound source direction can be effectively ensured, and the accuracy of intercepting the close-up image is further ensured.
The invention utilizes the self-adaptive beam forming method to enhance the audio signal in the sound source direction and eliminate the interference sound in other directions; the super-resolution spectrum is used for estimating the sound source direction of the site speaker in real time, and the judgment accuracy of the sound source direction and the collected audio signal quality of the site speaker can be effectively guaranteed.
Drawings
FIG. 1 is a flow chart of a panoramic video recording method based on voice tracking according to the present invention;
FIG. 2 is a schematic block diagram of the circuit of the panoramic video recording apparatus based on voice tracking according to the present invention;
FIG. 3 is a functional block diagram of a panoramic video recording apparatus based on voice tracking according to the present invention;
FIG. 4 is a schematic distribution diagram of a microphone array of the present invention;
FIG. 5 is a schematic view of a panoramic video image output of the present invention;
FIG. 6 is an output schematic diagram of a panoramic video image and 1 close-up image of the present invention;
FIG. 7 is an output schematic diagram of a panoramic video image and 2 close-up images of the present invention;
fig. 8 is a schematic overall appearance of a panoramic video recording apparatus based on voice tracking according to the present invention;
fig. 9 is a schematic overall appearance diagram of a panoramic video recording device based on voice tracking according to the present invention.
Detailed Description
The present invention is described in detail with reference to the embodiments shown in the drawings, but it should be understood that these embodiments are not intended to limit the present invention, and those skilled in the art should understand that functional, methodological, or structural equivalents or substitutions made by these embodiments are within the scope of the present invention.
The first embodiment is as follows:
panoramic video recording method based on voice tracking
As shown in fig. 1, a panoramic video recording method based on voice tracking includes the following steps: the method comprises the following steps:
step 1: and acquiring a plurality of paths of audio signals and a plurality of paths of video signals, and fusing and splicing the plurality of paths of video signals through panoramic videos to form a panoramic video image.
As shown in fig. 4, 8 and 9, the microphone array 1 is composed of a plurality of microphones, wherein 1 microphone is located at the center of a circle, and the rest microphones are uniformly distributed along the circumferential direction; as shown in fig. 8 and 9, the multiple video sensors 3 are uniformly distributed along the circumferential direction; the number and the position distribution of the microphones and the video sensors are matched with each other. In the embodiment, 6 paths of video sensors are provided; the microphone array consists of 7 microphones.
Step 2: estimating the sound source direction of a live speaker in real time according to the audio signal; and intercepting a live speaker close-up image at a corresponding position in the panoramic video image according to the sound source direction, and integrating the live speaker close-up image and the panoramic video image to form a panoramic video output image.
And estimating the sound source direction of the speaker on site in real time by using the super-resolution spectrum. And enhancing the audio signal in the sound source direction by using a self-adaptive beam forming method, and eliminating the interference sound in other directions.
As shown in fig. 5 to 7, it is determined whether there is a live speaker; and when the speaker is judged to be not live (namely, when no live sound source is judged), taking the panoramic video image obtained in the step 1 as a panoramic video output image. And when the live speaker is judged, integrating the live speaker close-up image and the panoramic video image to form a panoramic video output image. The close-up images of the live speakers are 1, 2 or more. Where 101A, 101B, 101C close-up images for live speakers.
And step 3: and transmitting the audio signal and the video signal of the panoramic video output image to an upper computer through a network or directly outputting the audio signal and the video signal through monitoring equipment.
The upper computer carries out face recognition on the close-up image of the site speaker and identifies the identity of the speaker; and identifying the audio signal, converting the voice into characters, storing the data, and labeling the identity of the speaker to the data. After data compression is carried out on the audio signal and the video signal, the audio signal and the video signal are uploaded to an upper computer through a network; or, the video signal is output in a scaling mode according to the resolution of the monitoring equipment. In this embodiment, the audio signal and the video signal data are encoded according to the h.265 format, and are sent to the upper computer through the network interface according to the UDP protocol. The monitoring equipment is HDMI equipment, and if the video signal is not zoomed, the HDMI equipment directly plays the original panoramic video output image.
Example two:
panoramic video recording device based on voice tracking
As shown in fig. 8 and 9, the microphone array 1 is disposed on the top of the housing 4, and the microphone array 1 is used for acquiring multiple audio signals; multichannel video sensor 3 sets up in 4 sides of casing for gather multichannel video signal. The microphone array 1 consists of a plurality of microphones, wherein 1 microphone is positioned at the position of a circle center, and the rest microphones are uniformly distributed along the circumferential direction; the number and the position distribution of the microphones and the video sensors are matched with each other. In the embodiment, 6 paths of video sensors are provided; the microphone array consists of 7 microphones. Be equipped with lamp area 2 on casing 4, lamp area 2 encircles the setting along casing circumferencial direction, and 4 below of casing are equipped with A-frame 5 or configuration base 6, and the circuit is hidden in the leg tube, and is pleasing to the eye and protective nature is good.
The audio/video processing device (main board) is disposed in the housing and is configured to implement the method described in the first embodiment. As shown in fig. 2 and fig. 3, the audio/video processing device (motherboard) can be divided into four functional modules: the device comprises a video processing module, an audio processing module, a video recombination module and an output module. The video processing module acquires the multi-channel signals acquired by the 6 channels of video sensors, and performs 6-mesh panoramic fusion and splicing to obtain a panoramic video image. The audio processing module acquires 7 paths of audio signals acquired by the microphone array. The method comprises the steps of utilizing super-resolution spectrum estimation to calculate the sound source direction of a speaker in real time, utilizing self-adaptive beam forming to enhance voice signals in the speaker direction, and eliminating interference sounds in other directions. The video recombination module intercepts a local image (a live speaker close-up image) at a corresponding position from the panoramic video according to the sound source direction information output by the audio processing module, and generates a new image by the panoramic video image and the intercepted image according to the resolution of the external display equipment in the memory. The output module outputs the audio data processed by the audio processing module and the image obtained by the video recombination module, can output two paths, one path of the output module compresses and encodes the audio and video signals, has an output format of H.265 and uploads the audio and video signals to an upper computer through a network protocol; and the other path of the audio/video signal can directly output the audio/video signal through the HDMI interface through the monitoring equipment.
The above-listed detailed description is only a specific description of a possible embodiment of the present invention, and they are not intended to limit the scope of the present invention, and equivalent embodiments or modifications made without departing from the technical spirit of the present invention should be included in the scope of the present invention.
It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential attributes thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.
Claims (10)
1. A panoramic video recording method based on voice tracking is characterized by comprising the following steps:
step 1: collecting a plurality of paths of audio signals and a plurality of paths of video signals, and fusing and splicing the plurality of paths of video signals through panoramic videos to form a panoramic video image;
step 2: estimating the sound source direction of a live speaker in real time according to the audio signal; intercepting a live speaker close-up image at a corresponding position in the panoramic video image according to the sound source direction, and integrating the live speaker close-up image and the panoramic video image to form a panoramic video output image;
and step 3: and uploading the audio signal and the panoramic video output image to an upper computer through a network or directly outputting the audio signal and the panoramic video output image through monitoring equipment.
2. The method for recording panoramic video based on voice tracking according to claim 1, wherein the step 3 further comprises: carrying out face recognition on the close-up image of the live speaker to recognize the identity of the speaker; and identifying the audio signal, converting the voice into characters, storing the data, and labeling the identity of the speaker to the data.
3. The method of claim 1, wherein the panoramic video recording method based on voice tracking comprises: the multi-channel audio signals are collected through a microphone array, and the multi-channel video signals are collected through a multi-channel video sensor.
4. The method of claim 3, wherein the panoramic video recording based on voice tracking is as follows: the microphone array consists of a plurality of microphones, wherein 1 microphone is positioned at the position of a circle center, and the rest microphones are uniformly distributed along the circumferential direction;
the multiple paths of video sensors are uniformly distributed along the circumferential direction;
the number and the position distribution of the microphones and the video sensors are matched with each other.
5. The method for recording panoramic video based on voice tracking according to claim 1, wherein the step 2 further comprises: and enhancing the audio signal in the sound source direction by using a self-adaptive beam forming method, and eliminating the interference sound in other directions.
6. The method of claim 1, wherein the panoramic video recording method based on voice tracking comprises: in step 2, the sound source direction of the site speaker is obtained by utilizing the super-resolution spectrum to estimate in real time.
7. The method for recording panoramic video based on voice tracking according to claim 1, wherein the step 2 further comprises: judging whether a live speaker exists or not; and when the speaker is not present, taking the panoramic video image obtained in the step 1 as a panoramic video output image.
8. The method of claim 1, wherein the panoramic video recording method based on voice tracking comprises: and 3, performing data compression on the audio signal and the video signal, and uploading the audio signal and the video signal to an upper computer through a network.
9. A panoramic video recording device based on voice tracking is characterized by comprising:
a housing;
the microphone array is arranged on the shell and used for acquiring multi-path audio signals;
the multi-channel video sensor is arranged on the shell and used for acquiring multi-channel video signals; and
set up the audio frequency video processing apparatus in the casing, including video processing module, audio frequency processing module, video recombination module and output module, wherein:
the video processing module acquires video signals acquired by the multiple paths of video sensors, and performs panoramic fusion and splicing to obtain a panoramic video image;
the audio processing module acquires multi-channel audio signals acquired by the microphone array, calculates the sound source direction of a speaker in real time, enhances the voice signals in the sound source direction and eliminates interference sounds in other directions;
the video recombination module intercepts local images at corresponding positions from the panoramic video image according to the sound source direction output by the audio processing module, and integrates the panoramic video image and the intercepted local images to generate a new image;
and the output module outputs the audio data processed by the audio processing module and the image generated by the video recombination module.
10. The panoramic video recording apparatus based on voice tracking as claimed in claim 9, wherein: the lamp belt is further arranged on the shell and arranged in a surrounding mode along the circumferential direction of the shell.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010021698.0A CN111163281A (en) | 2020-01-09 | 2020-01-09 | Panoramic video recording method and device based on voice tracking |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010021698.0A CN111163281A (en) | 2020-01-09 | 2020-01-09 | Panoramic video recording method and device based on voice tracking |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111163281A true CN111163281A (en) | 2020-05-15 |
Family
ID=70562024
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010021698.0A Pending CN111163281A (en) | 2020-01-09 | 2020-01-09 | Panoramic video recording method and device based on voice tracking |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111163281A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111551921A (en) * | 2020-05-19 | 2020-08-18 | 北京中电慧声科技有限公司 | Sound source orientation system and method based on sound image linkage |
CN111918127A (en) * | 2020-07-02 | 2020-11-10 | 影石创新科技股份有限公司 | Video clipping method and device, computer readable storage medium and camera |
CN112073613A (en) * | 2020-09-10 | 2020-12-11 | 广州视源电子科技股份有限公司 | Conference portrait shooting method, interactive tablet, computer equipment and storage medium |
CN113676622A (en) * | 2020-05-15 | 2021-11-19 | 杭州海康威视数字技术股份有限公司 | Video processing method, image pickup apparatus, video conference system, and storage medium |
CN114666454A (en) * | 2020-12-23 | 2022-06-24 | 沈阳新松机器人自动化股份有限公司 | Intelligent conference system |
CN117037844A (en) * | 2023-10-10 | 2023-11-10 | 中国传媒大学 | Panoramic audio generation method and system based on panoramic video |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003092731A (en) * | 2001-09-18 | 2003-03-28 | Mega Chips Corp | Image distribution method, image display recording method, and their programs |
CN102469295A (en) * | 2010-10-29 | 2012-05-23 | 华为终端有限公司 | Conference control method, related equipment and system |
CN104539896A (en) * | 2014-12-25 | 2015-04-22 | 桂林远望智能通信科技有限公司 | Intelligent panoramic monitoring and hotspot close-up monitoring system and method |
CN104767963A (en) * | 2015-03-27 | 2015-07-08 | 华为技术有限公司 | Method and device for representing information of persons participating in video conference |
CN105893948A (en) * | 2016-03-29 | 2016-08-24 | 乐视控股(北京)有限公司 | Method and apparatus for face identification in video conference |
CN107613243A (en) * | 2017-11-02 | 2018-01-19 | 深圳市裂石影音科技有限公司 | A kind of panoramic video recording arrangement and method for recording based on tone tracking |
CN107770484A (en) * | 2016-08-19 | 2018-03-06 | 杭州海康威视数字技术股份有限公司 | A kind of video monitoring information generation method, device and video camera |
CN109474797A (en) * | 2019-01-04 | 2019-03-15 | 北京快鱼电子股份公司 | Meeting re-recording system based on full-view camera and microphone array |
CN109788232A (en) * | 2018-12-18 | 2019-05-21 | 视联动力信息技术股份有限公司 | A kind of summary of meeting recording method of video conference, device and system |
CN110072075A (en) * | 2019-04-30 | 2019-07-30 | 平安科技(深圳)有限公司 | Conference management method, system and readable storage medium based on face recognition |
-
2020
- 2020-01-09 CN CN202010021698.0A patent/CN111163281A/en active Pending
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003092731A (en) * | 2001-09-18 | 2003-03-28 | Mega Chips Corp | Image distribution method, image display recording method, and their programs |
CN102469295A (en) * | 2010-10-29 | 2012-05-23 | 华为终端有限公司 | Conference control method, related equipment and system |
CN104539896A (en) * | 2014-12-25 | 2015-04-22 | 桂林远望智能通信科技有限公司 | Intelligent panoramic monitoring and hotspot close-up monitoring system and method |
CN104767963A (en) * | 2015-03-27 | 2015-07-08 | 华为技术有限公司 | Method and device for representing information of persons participating in video conference |
CN105893948A (en) * | 2016-03-29 | 2016-08-24 | 乐视控股(北京)有限公司 | Method and apparatus for face identification in video conference |
CN107770484A (en) * | 2016-08-19 | 2018-03-06 | 杭州海康威视数字技术股份有限公司 | A kind of video monitoring information generation method, device and video camera |
CN107613243A (en) * | 2017-11-02 | 2018-01-19 | 深圳市裂石影音科技有限公司 | A kind of panoramic video recording arrangement and method for recording based on tone tracking |
CN109788232A (en) * | 2018-12-18 | 2019-05-21 | 视联动力信息技术股份有限公司 | A kind of summary of meeting recording method of video conference, device and system |
CN109474797A (en) * | 2019-01-04 | 2019-03-15 | 北京快鱼电子股份公司 | Meeting re-recording system based on full-view camera and microphone array |
CN110072075A (en) * | 2019-04-30 | 2019-07-30 | 平安科技(深圳)有限公司 | Conference management method, system and readable storage medium based on face recognition |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113676622A (en) * | 2020-05-15 | 2021-11-19 | 杭州海康威视数字技术股份有限公司 | Video processing method, image pickup apparatus, video conference system, and storage medium |
CN111551921A (en) * | 2020-05-19 | 2020-08-18 | 北京中电慧声科技有限公司 | Sound source orientation system and method based on sound image linkage |
CN111918127A (en) * | 2020-07-02 | 2020-11-10 | 影石创新科技股份有限公司 | Video clipping method and device, computer readable storage medium and camera |
CN112073613A (en) * | 2020-09-10 | 2020-12-11 | 广州视源电子科技股份有限公司 | Conference portrait shooting method, interactive tablet, computer equipment and storage medium |
CN112073613B (en) * | 2020-09-10 | 2021-11-23 | 广州视源电子科技股份有限公司 | Conference portrait shooting method, interactive tablet, computer equipment and storage medium |
CN114666454A (en) * | 2020-12-23 | 2022-06-24 | 沈阳新松机器人自动化股份有限公司 | Intelligent conference system |
CN117037844A (en) * | 2023-10-10 | 2023-11-10 | 中国传媒大学 | Panoramic audio generation method and system based on panoramic video |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111163281A (en) | Panoramic video recording method and device based on voice tracking | |
JP6984596B2 (en) | Audiovisual processing equipment and methods, as well as programs | |
EP2046032B1 (en) | A method and an apparatus for obtaining acoustic source location information and a multimedia communication system | |
CN112165590B (en) | Video recording implementation method and device and electronic equipment | |
CN1984310B (en) | Method and communication apparatus for reproducing a moving picture | |
CN104995681A (en) | Video analysis assisted generation of multi-channel audio data | |
KR20120053006A (en) | Improved audio/video methods and systems | |
JP2019220848A (en) | Data processing apparatus, data processing method and program | |
JP7428763B2 (en) | Information acquisition system | |
CN111656275B (en) | Method and device for determining image focusing area | |
US9756421B2 (en) | Audio refocusing methods and electronic devices utilizing the same | |
US11342001B2 (en) | Audio and video processing | |
WO2011108377A1 (en) | Coordinated operation apparatus, coordinated operation method, coordinated operation control program and apparatus coordination system | |
CN113707165B (en) | Audio processing method and device, electronic equipment and storage medium | |
JP6835205B2 (en) | Shooting sound pickup device, sound pick-up control system, shooting sound pick-up device control method, and shooting sound pick-up control system control method | |
Angkoso et al. | Penerapan Penataan Suara pada Produksi Acara Siaran Kethoprak Mataram di LPP RRI Stasiun Yogyakarta: Application of Sound Management in the Production of the Mataram Kethoprak Broadcast at LPP RRI Yogyakarta Station | |
US11665391B2 (en) | Signal processing device and signal processing system | |
CN214851543U (en) | Recording and broadcasting equipment | |
KR101747800B1 (en) | Apparatus for Generating of 3D Sound, and System for Generating of 3D Contents Using the Same | |
CN113824916A (en) | Image display method, device, equipment and storage medium | |
TW202228446A (en) | Sound source tracking system and method | |
JP7111202B2 (en) | SOUND COLLECTION CONTROL SYSTEM AND CONTROL METHOD OF SOUND COLLECTION CONTROL SYSTEM | |
CN117636928A (en) | Pickup device and related audio enhancement method | |
JPWO2019031004A1 (en) | Imaging system, imaging device, and imaging method | |
JP2017184154A (en) | Sound collection and reproduction device, sound collection and reproduction program, sound collection device and reproduction device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200515 |