[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2024055797A1 - 一种录像中抓拍图像的方法及电子设备 - Google Patents

一种录像中抓拍图像的方法及电子设备 Download PDF

Info

Publication number
WO2024055797A1
WO2024055797A1 PCT/CN2023/113138 CN2023113138W WO2024055797A1 WO 2024055797 A1 WO2024055797 A1 WO 2024055797A1 CN 2023113138 W CN2023113138 W CN 2023113138W WO 2024055797 A1 WO2024055797 A1 WO 2024055797A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
electronic device
images
frames
camera
Prior art date
Application number
PCT/CN2023/113138
Other languages
English (en)
French (fr)
Other versions
WO2024055797A9 (zh
Inventor
许集润
Original Assignee
荣耀终端有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 荣耀终端有限公司 filed Critical 荣耀终端有限公司
Priority to EP23864531.1A priority Critical patent/EP4436198A1/en
Publication of WO2024055797A1 publication Critical patent/WO2024055797A1/zh
Publication of WO2024055797A9 publication Critical patent/WO2024055797A9/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72439User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for image or video messaging

Definitions

  • the present application relates to the field of photography technology, and in particular, to a method and electronic device for capturing images during video recording.
  • Existing mobile phones generally have camera and video functions, and more and more people use mobile phones to take photos and videos to record every detail of their lives.
  • video that is, video recording
  • some wonderful pictures may be collected.
  • the user may hope that the mobile phone can capture the above wonderful scenes and save them as photos to show to the user. Therefore, a solution that can capture images during video recording is urgently needed.
  • the mobile phone can intercept a frame of image collected at the moment of the user's capture in the video stream (such as a preview stream or a recording stream), and save it as a captured image as a photo to display to the user.
  • a large number of images (such as 30 frames of images) need to be processed per second.
  • mobile phones can generally use the ISP's hardware processing module to process the video stream using a relatively simple processing method; instead of using complex algorithms to improve Picture quality.
  • This kind of image processing effect can only meet the requirements of video; while taking pictures has higher requirements for image quality. Therefore, intercepting images from the video stream cannot capture images that satisfy the user.
  • This application provides a method and electronic device for capturing images during video recording, which can capture images during the video recording process and improve the image quality of the captured images.
  • the first aspect is to provide a method for capturing images in video recording, which method can be applied to electronic devices.
  • the electronic device may receive the user's first operation.
  • the first operation is used to trigger the electronic device to start recording video.
  • the camera of the electronic device collects the first image, and the electronic device displays the first interface.
  • the first interface is a viewfinder interface where the electronic device is recording a video.
  • the viewfinder interface displays a preview stream, and the preview stream includes a preview image obtained from the first image.
  • the first interface also includes a snapshot shutter.
  • the snap shutter is used to trigger an electronic device to snap an image to obtain a photo.
  • the electronic device can cache the first image collected by the camera in the first cache queue. Among them, the first cache queue caches n frames of first images collected by the camera, n ⁇ 1, and n is an integer.
  • the electronic device may select the second image from the n frames of first images cached in the first cache queue according to the additional information of the first image.
  • the additional information of the first image includes at least one of the contrast of the first image, the angular velocity when the camera collects the first image, and the timestamp of the first image.
  • the electronic device can perform first image processing on m frames of first images including the second image to obtain a captured image.
  • the first image processing includes a simple method of cropping the target preview image and cropping parameters. meal processing.
  • the target preview image is a frame of image collected by the camera when the electronic device receives the second operation in the preview stream. m ⁇ 1, m is an integer.
  • the above-mentioned first image processing has the function of improving image quality.
  • the electronic device can cache the Bayer image exposed by the image sensor (Sensor) in a first buffer queue (Buffer).
  • This Buffer can cache Bayer images.
  • the electronic device receives the user's snapping operation, the Bayer image output by the Sensor can also be cached in the first cache queue.
  • the image content of the frame output by the Sensor will not change much in a short period of time.
  • the frame selection module of the electronic device can select an image with better image quality from the Buffer as the captured image based on the additional information of the image buffered in the Buffer. In this way, the image quality of the captured image can be improved.
  • the electronic device can also perform first image processing on m frames of first images including the second image to obtain a captured image. Since the first image processing includes cropping processing according to the cropping method and cropping parameters of the target preview image, the electronic device can perform cropping processing on the m frames of the first image including the second image according to the cropping method and cropping parameters of the target preview image, The captured image can be obtained with the same field of view (FOV) as the target preview image, which can improve the quality of the captured image.
  • FOV field of view
  • images that meet user needs can be captured during the video recording process, and the image quality of the captured images can be improved.
  • the cropping method of the target preview image includes a center cropping method.
  • the cropping parameters of the target preview image include the center point coordinates of the cropping area of the target preview image and the cropped size information.
  • the electronic device may perform cropping processing on the m frames of the first image including the second image in a center cropping manner.
  • the center point coordinates of the cropping area of the target preview image and the cropped size information can be used to crop out the same size and the same FOV as the target preview image. captured images. In this way, the image quality of the captured image can be improved.
  • the electronic device performs first image processing on m frames of first images including the second image to obtain a captured image. Specifically, the electronic device performs image fusion on m frames of first images to obtain a third image. The electronic device performs cropping processing on the third image according to the cropping method and cropping parameters of the target preview image to obtain a fourth image. The electronic device performs second image processing on the fourth image to obtain a captured image.
  • the second image processing includes at least one of image noise reduction, brightness and acceptance correction, and image beautification processing.
  • the electronic device can improve the image processing efficiency and save the power consumption of the electronic device by merging multiple frames of the first image into one frame of image (ie, the third image) and only need to crop the one frame of image.
  • image noise reduction, brightness and acceptance correction can improve the image quality of the captured image
  • image beautification processing can improve the display effect of the captured image. Therefore, the electronic device performs second image processing on the fourth image, which can improve the image quality and display effect.
  • the method further includes: the electronic device obtains a logical identification of the target preview image.
  • the logical identifier of the target preview image is used to identify the camera that collects the target preview image.
  • the electronic device determines m frames of first images including the second image from n frames of first images according to the logical identification of the target preview image.
  • the logical identifier of the first image of the m frame is the same as the logical identifier of the target preview image.
  • one logical identifier corresponds to one camera.
  • the logical identifier of the first image of the m frame is the same as the logical identifier of the target preview image, indicating that the camera that collects the first image of the m frame is the same as the camera that collects the target preview image.
  • the head is the same camera.
  • the mobile phone can determine a first frame of n frames of first images collected by the same camera as the camera that collects the target preview image as a frame of m frames of first images. It is possible to conveniently and quickly determine m frames of first images from n frames of first images, thereby improving the efficiency of generating captured images.
  • one logical identifier can correspond to multiple cameras, or it can also be understood that one logical identifier corresponds to a camera set, and the camera set includes multiple cameras.
  • the logical identifier of the first image of the m frame is the same as the logical identifier of the target preview image, indicating that the camera that collects the first image of the m frame and the camera that collects the target preview image belong to the same camera set.
  • the mobile phone can determine a frame of first images collected by each of the multiple cameras included in the same camera set as one frame of m frames of first images. A large number of first images to be subjected to first image processing can be determined, thereby improving the image quality of the captured images.
  • the m frames of the first image are m consecutive m frames of images including the second image among the n frames of the first image.
  • m frames of first images include second images, and m-1 frames of first images among n frames of first images whose resolution is greater than the preset resolution threshold.
  • m frames of first images include second images, and m-1 frames of images in n frames of first images whose high dynamic range parameters meet the preset HDR conditions.
  • m consecutive m frames of images including the second image are m adjacent m frames of first images including the second image. The image content of adjacent images is more similar, so using m consecutive m-frame images to obtain the captured image is more conducive to improving the image quality of the captured image.
  • the resolution of the first image in frame m-1 is greater than the preset resolution threshold, which means that the resolution of the first image in frame m-1 is larger, which can also be understood to mean that the first image in frame m-1 has a higher definition. Therefore, using the first image with a larger resolution to obtain the captured image is more conducive to improving the image quality of the captured image.
  • the HDR parameters of the first image in frame m-1 meet the preset HDR conditions, indicating that the first image in frame m-1 has a high brightness dynamic range and richer colors. It can also be understood as the painting of the first image in frame m-1. Higher quality. Therefore, using the first image with higher image quality to obtain the captured image is more conducive to improving the image quality of the captured image.
  • the additional information of the first image includes the contrast of the first image, and the contrast of the first image is used to characterize the clarity of the first image.
  • the second image is: the first image with the highest contrast among the n frames of first images cached in the first cache queue.
  • the electronic device can select the image with the highest definition from n frames of first images as the second image (ie, the alternative captured image). In this way, it is helpful to improve the image quality of the captured image.
  • the additional information of the first image includes an angular velocity when the camera collects the first image, and the angular velocity is used to characterize the jitter situation when the camera collects the first image.
  • the above-mentioned second image is: the first image with the smallest angular velocity among the n frames of first images cached in the first buffer queue.
  • the electronic device can select the image with the smallest jitter from the n frames of first images as the second image (i.e., the alternative captured image). This is conducive to improving the image quality of the captured image.
  • the additional information of the first image also includes a timestamp of the first image; each frame of the first image includes a timestamp, and the timestamp is recorded on the image sensor of the electronic device. lose Output the time corresponding to the first image.
  • the clock of the upper-layer application in the electronic device is synchronized with the clock of the image sensor that records the first image and renders the image; or, the clock of the upper-layer application of the electronic device and the clock of the image sensor that records the first image and renders the image are the same system clock.
  • the second image is: among the n frames of first images cached in the first cache queue, the first image whose time stamp is recorded is closest to the time when the electronic device receives the second operation.
  • the electronic device uses the first image whose time stamp is recorded closest to the time when the electronic device receives the second operation as the second image, which is beneficial to capturing the image content that the user really wants.
  • only one frame of image i.e., the second image
  • the first image processing which can reduce the operating load of the mobile phone and save the power consumption of the mobile phone. If the current temperature of the mobile phone is less than or equal to the preset temperature threshold, it means that the current temperature of the mobile phone is not very high. At this time, the mobile phone can obtain the captured image based on the first image of multiple frames, which can improve the image quality of the captured image.
  • this application provides an electronic device, which includes a touch screen, a memory, a camera, a display screen, and one or more processors.
  • the touch screen, memory, camera, display screen and processor are coupled.
  • computer program code is stored in the memory, and the computer program code includes computer instructions.
  • the electronic device causes the electronic device to perform the method described in the first aspect and any possible design manner thereof.
  • this application provides an electronic device, which includes a touch screen, a memory, a camera, and one or more processors.
  • the touch screen, memory, camera and processor are coupled.
  • computer program code is stored in the memory, and the computer program code includes computer instructions.
  • the electronic device performs the following steps: receiving a first operation from the user; wherein the first operation is used to trigger the electronic device.
  • the camera collects the first image
  • the display screen displays the first interface
  • the first interface is a viewfinder interface in which the electronic device is recording video
  • the viewfinder interface displays a preview stream
  • the preview stream includes the first
  • the first interface also includes a capture shutter, which is used to trigger the electronic device to capture the image and obtain the photo
  • the first cache queue caches the first image collected by the camera
  • the first cache queue caches the n collected by the camera.
  • Frame the first image, n ⁇ 1, n is an integer; in response to the user's second operation on the snap shutter, select the second image from the n frames of the first image cached in the first cache queue based on the additional information of the first image ;
  • the additional information of the first image includes at least one of the contrast of the first image, the angular velocity when the camera collects the first image, and the timestamp of the first image; perform the first step on the m frames of the first image including the second image.
  • Image processing is performed to obtain a captured image; wherein, the first image processing includes cropping processing according to the cropping method and cropping parameters of the target preview image, and the target preview image is an image collected by the camera when the electronic device receives the second operation in the preview stream.
  • Frame image m ⁇ 1, m is an integer; the first image processing has the function of improving image quality.
  • the cropping method of the above target preview image includes:
  • the cropping parameters of the target preview image include the coordinates of the center point of the cropping area of the target preview image and the cropped size information.
  • the electronic device When the computer instructions are executed by the processor, the electronic device also performs the following steps: perform image fusion on m frames of the first image to obtain a third image; perform cropping processing on the third image according to the cropping method and cropping parameters of the target preview image. , obtain a fourth image; perform second image processing on the fourth image to obtain a captured image; wherein the second image processing includes: at least one of image noise reduction, brightness and acceptance correction, and image beautification processing.
  • the electronic device when the computer instructions are executed by the processor, can also perform the following steps: obtain the logical identification of the target preview image, and the logical identification of the target preview image is used for identification collection The camera of the target preview image; according to the logical identification of the target preview image, determine the m frames of first images including the second image from the n frames of first images; wherein the logical identification of the m frames of the first image is the same as the target The preview images have the same logical identity.
  • the m frames of the first image are m consecutive m frames of images including the second image in the n frames of the first image; or, the m frames of the first image include the second image, and n m-1 frames of the first image in the frame of the first image, the resolution of which is greater than the preset resolution threshold; or, m frames of the first image include the second image, and n frames of the first image, the HDR parameters satisfy the preset HDR condition The first image of m-1 frames.
  • the additional information of the first image includes the contrast of the first image, and the contrast of the first image is used to characterize the clarity of the first image.
  • the second image is: the first image with the highest contrast among the n frames of first images cached in the first cache queue.
  • the additional information of the first image includes the angular velocity when the camera collects the first image, and the angular velocity is used to characterize the jitter situation when the camera collects the first image.
  • the second image is: the first image with the smallest angular velocity among the n frames of first images cached in the first buffer queue.
  • the additional information of the first image also includes a timestamp of the first image; each frame of the first image includes a timestamp, and the timestamp is recorded by the image sensor of the electronic device. Output the time corresponding to the first image.
  • the clock of the upper-layer application in the electronic device is synchronized with the clock of the image sensor that records the first image and renders the image; or, the clock of the upper-layer application of the electronic device and the clock of the image sensor that records the first image and renders the image are the same system clock.
  • the second image is: among the n frames of first images cached in the first cache queue, the first image whose time stamp is recorded is closest to the time when the electronic device receives the second operation.
  • the present application provides a computer-readable storage medium.
  • the computer-readable storage medium includes computer instructions.
  • the electronic device causes the electronic device to execute the first aspect and any possible method thereof. The method described in the design method.
  • this application provides a computer program product.
  • the computer program product When the computer program product is run on a computer, it causes the computer to execute the method described in the first aspect and any possible design manner.
  • the computer It can be the electronic equipment mentioned above.
  • Figure 1 is the processing flow chart of the Sensor output image, ISP and ENCODE processing the image to obtain a high-definition image
  • Figure 2 is a schematic diagram of a video viewing interface of a mobile phone provided by an embodiment of the present application
  • Figure 3 is a schematic diagram of the delay time between a mobile phone receiving a snapshot operation and a Sensor receiving a snapshot instruction provided in an embodiment of the present application;
  • Figure 4A is a principle block diagram of a method for a mobile phone to intercept a frame of image from a video stream as a captured image provided by an embodiment of the present application;
  • Figure 4B is a schematic block diagram of a method for a mobile phone to use the captured image obtained by the user during the shooting process as a captured image provided by an embodiment of the present application;
  • Figure 4C is a schematic block diagram of a method for capturing images during video recording provided by an embodiment of the present application.
  • Figure 4D is a schematic block diagram of another method for capturing images during video recording provided by an embodiment of the present application.
  • Figure 5A is a schematic structural diagram of an electronic device 500 provided by an embodiment of the present application.
  • Figure 5B is a schematic diagram of the software architecture of a mobile phone provided by an embodiment of the present application.
  • Figure 6 is a flow chart of a method for capturing images during video recording provided by an embodiment of the present application.
  • Figure 7 is a schematic diagram of a display interface of a mobile phone provided by an embodiment of the present application.
  • Figure 8 is a schematic diagram of a first cache queue provided by an embodiment of the present application.
  • Figure 9 is a schematic diagram of a display interface of another mobile phone provided by an embodiment of the present application.
  • Figure 10 is a schematic diagram of a display interface of another mobile phone provided by an embodiment of the present application.
  • Figure 11 is a flow chart of another method for capturing images during video recording provided by an embodiment of the present application.
  • Figure 12 is a schematic block diagram of another method for capturing images during video recording provided by an embodiment of the present application.
  • Figure 13 is a flow chart of another method for capturing images during video recording provided by an embodiment of the present application.
  • Figure 14 is a schematic structural diagram of a chip system provided by an embodiment of the present application.
  • the image sensor of the mobile phone is controlled by the exposure and can continuously output Bayer images.
  • Each frame of Bayer image is processed by the mobile phone's image signal processor (image signal processor, ISP), and then encoded by the encoder (ENCODER) to obtain a video stream (such as a preview stream or recording stream).
  • image signal processor image signal processor
  • ENCODER encoder
  • Figure 1 shows the processing flow of the preview stream and recording stream in the mobile phone after the image sensor (Sensor) outputs the image during the mobile phone recording process.
  • the preview stream refers to the video stream presented to the user on the display screen during the recording process of the mobile phone
  • the recording stream refers to the video stream saved in the mobile phone for the user to view after the recording is completed.
  • the image can be processed by the ISP; after the image processing by the ISP, it can be divided into two data streams.
  • One data stream is processed using processing algorithm 1 shown in Figure 1, and then encoded by encoder 1 to obtain a preview stream.
  • the other data stream is processed using the processing algorithm 2 shown in Figure 1, and then can be encoded by the encoder 2 to obtain the video stream.
  • processing algorithm 1 can also be called the post-processing algorithm of the preview stream, and the processing algorithm 2 can also be called the post-processing algorithm of the video stream.
  • Processing algorithm 1 and processing algorithm 2 may include anti-shake processing, denoising processing, blur processing, color and brightness adjustment and other processing functions.
  • the way the mobile phone handles the preview stream and the recording stream during the recording process includes but is not limited to the way shown in Figure 1.
  • the ISP can perform part of the image processing on the image (such as "RAW domain” and "RGB domain” image processing). Then, it can be divided into two data streams; one data stream is processed by processing algorithm 1, and then the other part of the image processing is performed by the ISP (such as "YUV domain” image processing), and then the preview stream can be encoded by encoder 1.
  • the other data stream is processed using processing algorithm 2, and then the ISP performs another part of the image processing (such as "YUV domain” image processing), and then passes through the encoder 2 to encode the video stream.
  • the processing method of the preview stream and the recording stream shown in Figure 1 is taken as an example to introduce the method of the embodiment of the present application.
  • the Sensor outputs images, the ISP and the encoder i.e. ENCODER, such as encoder 1 and encoder 2
  • the Sensor output images, ISP and the encoder can process the images
  • the data stream in the entire process (such as preview stream and recording stream) is called video stream.
  • the mobile phone can capture images in response to the user's operations.
  • the mobile phone can display the video framing interface 201 shown in FIG. 2 .
  • the video framing interface 201 includes a capture shutter 202, which is used to trigger the mobile phone to capture images during the video recording and save them as photos.
  • the mobile phone can capture images in response to the user's click operation on the capture shutter 202 shown in FIG. 2 .
  • what the user wants the mobile phone to capture is the image collected by the camera at the moment when the user clicks the capture shutter 202 .
  • the first frame image collected when the Snapshot program of the mobile phone receives the capture instruction can be selected as the capture image (the 7th frame image as shown in Figure 3).
  • the upper-layer application the camera application corresponding to the viewfinder interface 201 of the video recording as shown in Figure 2
  • receives the user's snapshot operation such as the user's click operation on the snapshot shutter 202
  • it takes time to transmit the snapshot instruction to the Snapshot program such as The delay length shown in Figure 3.
  • the Sensor will not stop outputting Bayer images. Therefore, from the time the upper-layer application receives the user's snapshot operation to the time the Snapshot program receives the snapshot instruction, the Sensor may have output multiple frames of Bayer images.
  • the image sensor (Sensor) outputs the Bayer image of the third frame
  • the upper layer The application receives the snapshot operation; when the Sensor outputs the 7th frame Bayer image, the snapshot instruction is passed to the Snapshot program.
  • the seventh frame image is not the frame image at the moment when the user clicks the capture shutter 202.
  • the first frame of image is the earliest frame of the Sensor
  • the 8th frame of image is the latest frame of the Sensor.
  • the image sensor (Sensor) can start from the first frame of image and sequentially expose and output 8 frames of images as shown in Figure 3.
  • the mobile phone can intercept a frame of image captured at the moment of the user's capture in a video stream (such as a preview stream or a video stream), and save it as a captured image as a photo to display to the user.
  • a video stream such as a preview stream or a video stream
  • each frame of Bayer image output by the mobile phone's image sensor passes through the mobile phone's image front-end engine, anti-shake module, image processing engine, and color processing module to obtain a preview stream.
  • the image front-end engine can be the ISP in Figure 1, Figure 4C, and Figure 4D.
  • the image front-end engine is used to perform part of the image processing (such as "RAW domain” and "RGB domain” image processing) on each frame of Bayer image. ).
  • the anti-shake module, image processing engine, and color processing module may be functional modules corresponding to the image processing performed by the processing algorithm 1 in FIG. 1, FIG. 4C, and FIG. 4D.
  • the anti-shake module is a functional module corresponding to anti-shake processing, which is used to perform anti-shake processing on each frame of Bayer image.
  • the image processing engine can be a functional module corresponding to denoising processing, used to denoise each frame of Bayer image;
  • the color processing module can be a functional module corresponding to color and brightness adjustment, used to perform color and brightness adjustment on each frame of Bayer image. Brightness adjustment and other processing.
  • the mobile phone can intercept a frame of the image captured at the moment of the user's capture in the preview stream, and save it as a captured image as a photo to display to the user.
  • the mobile phone can also save the captured images obtained by the user during the shooting process as captured images into photos and display them to the user.
  • the captured images obtained by the user during the recording process are based on the shooting function included in the mobile phone (specifically, the upper-layer application). They are not images captured by the user during the recording process, and cannot effectively show every aspect of the user's recording process. One detail cannot capture an image that satisfies the user.
  • each frame of Bayer image output by the image sensor (Sensor) of the mobile phone passes through the image processing module of the mobile phone to generate a captured image.
  • the mobile phone can save the captured image as a snapshot image and display it to the user as a photo.
  • the image processing module can include anti-shake processing, denoising processing, blur processing, color and brightness adjustment and other processing functions. That is, the image processing module is used to perform anti-shake processing, denoising processing, etc. on each frame of Bayer image. Blur processing, color and brightness adjustment, etc.
  • Embodiments of the present application provide a method for capturing images during video recording, which can capture images during the video recording process and improve the image quality of the captured images.
  • the electronic device can cache the first image (ie, Bayer image) exposed by the Sensor in a first buffer queue (Buffer).
  • This Buffer can cache multiple frames of the first image (i.e. Bayer image).
  • the Bayer image output by the Sensor can also be cached in the first cache queue.
  • the frame selection module of the mobile phone can select an image with a higher quality from the Buffer.
  • a good frame of image i.e. the second image
  • the image quality of the captured image can be improved.
  • the electronic device can also perform first image processing on m frames of first images including the second image to obtain a captured image. Since the first image processing includes cropping processing according to the cropping method and cropping parameters of the target preview image; therefore, the electronic device can perform cropping processing on the m-frame first image including the second image according to the cropping method and cropping parameters of the target preview image, A captured image with the same field of view FOV as the target preview image can be obtained, which can improve the image quality of the captured image.
  • images that meet user needs can be captured during the video recording process, and the image quality of the captured images can be improved.
  • the above-mentioned first image processing may also include image processing performed by a preset RAW domain AI image enhancement algorithm model (referred to as a preset RAW domain image processing algorithm).
  • the electronic device can use a preset RAW domain image processing algorithm to process the snapshot frame selected by the frame selection module; finally, the encoder 3 encodes the processing result to obtain a snapshot stream.
  • the preset RAW domain image processing algorithm is a RAW domain image quality enhanced deep learning network.
  • the preset RAW domain image processing algorithm can be used to improve the image quality of the captured frames. That is, by using the method of the embodiment of the present application, images that meet user needs can be captured during the video recording process, and the image quality of the captured images can be improved.
  • the preset RAW domain image processing algorithm is a RAW domain image quality enhanced deep learning network.
  • the preset RAW domain image processing algorithm may also be called a preset image quality enhancement algorithm, a preset image quality enhancement algorithm model or a preset RAW domain AI model.
  • the preset RAW domain image processing algorithm may be a software image processing algorithm.
  • the preset RAW domain image processing algorithm may be a software algorithm in the mobile phone's hardware abstraction layer (HAL) algorithm library.
  • the preset RAW domain image processing algorithm may be a hardware image processing algorithm.
  • the preset RAW domain image processing algorithm may be a hardware image processing algorithm implemented by calling the image processing algorithm capability of the ISP.
  • the preset RAW domain image processing algorithm can also be called a preset image processing algorithm.
  • the reason why the embodiment of the present application is called a preset RAW domain image processing algorithm is because the input of the preset RAW domain image processing algorithm is an image in the RAW domain.
  • the output of the preset RAW domain image processing algorithm may be an image in the RAW domain or an image in the RGB domain, which is not limited in the embodiment of the present application.
  • the above-mentioned encoder 1, encoder 2 and encoder 3 may be three different encoders.
  • the mobile phone can use three different encoders to encode the above preview stream, video stream and capture stream respectively.
  • the above-mentioned encoder 1, encoder 2 and encoder 3 may be the same encoder.
  • An encoder can include multiple encoding units.
  • the mobile phone can use three different encoding units in one encoder to encode the preview stream, video stream and snapshot stream respectively.
  • encoder 1 and encoder 2 may be two different encoding units in the same encoder
  • encoder 3 may be another encoder.
  • the encoding methods of different encoders can be the same or different.
  • the encoding methods of different coding units of the same encoder can be the same or different. Therefore, the image formats output by the encoder in the display module and the encoder 1 may be the same or different.
  • the image output by the encoder and encoder 1 in the display module can be an image in any format such as Joint Photographic Experts Group (JPEG), Tag Image File Format (TIFF), etc. .
  • JPEG Joint Photographic Experts Group
  • TIFF Tag Image File Format
  • the image output by the image sensor (Sensor) shown in Figure 1, Figure 4A, Figure 4B, Figure 4C or Figure 4D is a Bayer format image (Bayer image for short).
  • Bayer, JPEG and TIFF are the three expression formats of images.
  • JPEG images please refer to the relevant content in general technology and will not be repeated here.
  • the electronic device in the embodiment of the present application may be a mobile phone, a tablet computer, a smart watch, a desktop, a laptop, a handheld computer, a notebook computer, an ultra-mobile personal computer (UMPC), or a netbook.
  • devices including cameras such as cellular phones, personal digital assistants (PDAs), augmented reality (AR), virtual reality (VR) devices, etc.
  • PDAs personal digital assistants
  • AR augmented reality
  • VR virtual reality
  • the embodiments of the present application are specific to the electronic devices. There are no special restrictions on the form.
  • FIG. 5A is a schematic structural diagram of an electronic device 500 provided by an embodiment of the present application.
  • the electronic device 500 may include: a processor 510, an external memory interface 520, an internal memory 521, a universal serial bus (USB) interface 530, a charging management module 540, a power management module 541, and a battery.
  • antenna 1 antenna 2
  • mobile communication module 550 wireless communication module 560
  • audio module 570 speaker 570A
  • receiver 570B wireless communication module 560
  • audio module 570 speaker 570A
  • receiver 570B microphone 570C
  • headphone interface 570D sensor module 580
  • button 590 motor 591, indicator 592, camera 593, display screen 594, and subscriber identification module (subscriber identification module, SIM) card interface 595, etc.
  • subscriber identification module subscriber identification module, SIM
  • the sensor module 580 may include a pressure sensor, a gyroscope sensor, an air pressure sensor, a magnetic sensor, an acceleration sensor, a distance sensor, a proximity light sensor, a fingerprint sensor, a temperature sensor, a touch sensor, an ambient light sensor, a bone conduction sensor, and other sensors.
  • the structure illustrated in this embodiment does not constitute a specific limitation on the electronic device 500 .
  • the electronic device 500 may include more or fewer components than illustrated, some components may be combined, some components may be separated, or components may be arranged differently.
  • the components illustrated may be implemented in hardware, software, or a combination of software and hardware.
  • the processor 510 may include one or more processing units.
  • the processor 510 may include an application processor (application processor, AP), a modem processor, a GPU, an image signal processor (image signal processor, ISP), a control unit. processor, memory, video codec, digital signal processor (DSP), baseband processor, and/or NPU, etc.
  • application processor application processor, AP
  • modem processor modem processor
  • GPU graphics processing unit
  • image signal processor image signal processor
  • ISP image signal processor
  • control unit processor
  • memory video codec
  • DSP digital signal processor
  • baseband processor baseband processor
  • NPU baseband processor
  • the controller may be the nerve center and command center of the electronic device 500 .
  • the controller can generate operation control signals based on the instruction operation code and timing signals to complete the control of fetching and executing instructions.
  • the processor 510 may also be provided with a memory for storing instructions and data.
  • the memory in processor 510 is cache memory. This memory may hold instructions or data that have been recently used or recycled by processor 510 . If the processor 510 needs to use the instructions or data again, it can be retrieved directly from the memory. Accept the call. Repeated access is avoided and the waiting time of the processor 510 is reduced, thus improving the efficiency of the system.
  • processor 510 may include one or more interfaces. It can be understood that the interface connection relationships between the modules illustrated in this embodiment are only schematic illustrations and do not constitute a structural limitation of the electronic device 500 . In other embodiments, the electronic device 500 may also adopt different interface connection methods in the above embodiments, or a combination of multiple interface connection methods.
  • the charge management module 540 is used to receive charging input from the charger. While the charging management module 540 charges the battery 542, it can also provide power to the electronic device through the power management module 541.
  • the power management module 541 is used to connect the battery 542, the charging management module 540 and the processor 510.
  • the power management module 541 receives input from the battery 542 and/or the charging management module 540 and supplies power to the processor 510, internal memory 521, external memory, display screen 594, camera 593, and wireless communication module 560.
  • the wireless communication function of the electronic device 500 can be implemented through the antenna 1, the antenna 2, the mobile communication module 550, the wireless communication module 560, the modem processor and the baseband processor.
  • Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals.
  • the antenna 1 of the electronic device 500 is coupled to the mobile communication module 550, and the antenna 2 is coupled to the wireless communication module 560, so that the electronic device 500 can communicate with the network and other devices through wireless communication technology.
  • the electronic device 500 implements display functions through a GPU, a display screen 594, an application processor, and the like.
  • the GPU is an image processing microprocessor and is connected to the display screen 594 and the application processor. GPUs are used to perform mathematical and geometric calculations for graphics rendering.
  • Processor 510 may include one or more GPUs that execute program instructions to generate or alter display information.
  • the display screen 594 is used to display images, videos, etc.
  • the display 594 includes a display panel.
  • the display panel can use a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active matrix organic light emitting diode or an active matrix organic light emitting diode (active-matrix organic light emitting diode).
  • LCD liquid crystal display
  • OLED organic light-emitting diode
  • AMOLED organic light-emitting diode
  • FLED flexible light-emitting diode
  • Miniled MicroLed, Micro-oLed, quantum dot light emitting diode (QLED), etc.
  • the electronic device 500 can implement the shooting function through an ISP, a camera 593, a video codec, a GPU, a display screen 594, and an application processor.
  • the ISP is used to process the data fed back by the camera 593. For example, when taking a photo, the shutter is opened, the light is transmitted to the camera sensor through the lens, the optical signal is converted into an electrical signal, and the camera sensor passes the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be provided in the camera 593.
  • Camera 593 is used to capture still images or video.
  • the object passes through the lens to produce an optical image that is projected onto the photosensitive element.
  • the photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
  • CMOS complementary metal-oxide-semiconductor
  • the photosensitive element converts the optical signal into an electrical signal, and then passes the electrical signal to the ISP to convert it into a digital image signal.
  • ISP outputs digital image signals to DSP for processing.
  • DSP converts digital image signals into standard RGB, YUV and other format image signals.
  • the electronic device 500 may include N cameras 593, where N is a positive integer greater than 1.
  • Digital signal processors are used to process digital signals. In addition to digital image signals, they can also process other digital signals. For example, when the electronic device 500 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy.
  • Video codecs are used to compress or decompress digital video.
  • Electronic device 500 may support one or more video codecs. In this way, the electronic device 500 can play or record videos in multiple encoding formats, such as: moving picture experts group (MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
  • MPEG moving picture experts group
  • MPEG2 MPEG2, MPEG3, MPEG4, etc.
  • NPU is a neural network (NN) computing processor.
  • NN neural network
  • Intelligent cognitive applications of the electronic device 500 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, etc.
  • the external memory interface 520 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 500.
  • the external memory card communicates with the processor 510 through the external memory interface 520 to implement the data storage function. Such as saving music, videos, etc. files in external memory card.
  • Internal memory 521 may be used to store computer executable program code, which includes instructions.
  • the processor 510 executes instructions stored in the internal memory 521 to execute various functional applications and data processing of the electronic device 500 .
  • the processor 510 can execute instructions stored in the internal memory 521, and the internal memory 521 can include a program storage area and a data storage area.
  • the stored program area can store an operating system, at least one application program required for a function (such as a sound playback function, an image playback function, etc.).
  • the storage data area may store data created during use of the electronic device 500 (such as audio data, phone book, etc.).
  • the internal memory 521 may include high-speed random access memory, and may also include non-volatile memory, such as at least one disk storage device, flash memory device, universal flash storage (UFS), etc.
  • the electronic device 500 can implement audio functions through the audio module 570, the speaker 570A, the receiver 570B, the microphone 570C, the headphone interface 570D, and the application processor. Such as music playback, recording, etc.
  • the buttons 590 include a power button, a volume button, etc.
  • Motor 591 can produce vibration prompts.
  • the indicator 592 may be an indicator light, which may be used to indicate charging status, power changes, or may be used to indicate messages, missed calls, notifications, etc.
  • the SIM card interface 595 is used to connect a SIM card.
  • the SIM card can be connected to or separated from the electronic device 500 by inserting it into the SIM card interface 595 or pulling it out from the SIM card interface 595 .
  • the electronic device 500 can support 1 or N SIM card interfaces, where N is a positive integer greater than 1.
  • SIM card interface 595 can support Nano SIM card, Micro SIM card, SIM card, etc.
  • the electronic device 500 is a mobile phone as an example to introduce the method of the embodiment of the present application.
  • Figure 5B is a software structure block diagram of the mobile phone according to the embodiment of the present application.
  • the layered architecture divides the software into several layers, and each layer has clear roles and division of labor.
  • the layers communicate through software interfaces.
  • the Android TM system is divided into five layers, from top to bottom: application layer, application framework layer, Android runtime (Android runtime) and system library, hardware abstraction layer (HAL) ) and the kernel layer.
  • HAL hardware abstraction layer
  • the application layer can include a series of application packages.
  • applications such as calls, games, cameras, navigation, browsers, calendars, maps, Bluetooth, music, and videos can be installed in the application layer.
  • an application with a shooting function such as a camera application
  • a shooting function such as a camera application
  • they can also call the camera application to implement the shooting function.
  • the application framework layer provides an application programming interface (API) and programming framework for applications in the application layer.
  • API application programming interface
  • the application framework layer includes some predefined functions.
  • the application framework layer may include a window manager, a content provider, a view system, a resource manager, a notification manager, etc.
  • This embodiment of the present application does not impose any limitation on this.
  • the window manager mentioned above is used to manage window programs.
  • the window manager can obtain the display size, determine whether there is a status bar, lock the screen, capture the screen, etc.
  • the above content providers are used to store and retrieve data and make this data accessible to applications. Said data can include videos, images, audio, calls made and received, browsing history and bookmarks, phone books, etc.
  • the above view system can be used to build the display interface of the application.
  • Each display interface can be composed of one or more controls.
  • controls can include interface elements such as icons, buttons, menus, tabs, text boxes, dialog boxes, status bars, navigation bars, and widgets.
  • the above resource manager provides various resources to applications, such as localized strings, icons, pictures, layout files, video files, etc.
  • the above notification manager allows the application to display notification information in the status bar, which can be used to convey notification type messages, and can automatically disappear after a short stay without user interaction.
  • the notification manager is used to notify download completion, message reminders, etc.
  • the notification manager can also be notifications that appear in the status bar at the top of the system in the form of charts or scroll bar text, such as notifications for applications running in the background, or notifications that appear on the screen in the form of conversation windows. For example, prompt text information in the status bar, sound a beep, vibrate, blink the indicator light, etc.
  • the Android runtime includes core libraries and virtual machines. Android runtime is responsible for the scheduling and management of the Android system.
  • the core library contains two parts: one is the functional functions that need to be called by the Java language, and the other is the core library of Android.
  • the application layer and application framework layer run in virtual machines.
  • the virtual machine executes the java files of the application layer and application framework layer into binary files.
  • the virtual machine is used to perform object life cycle management, stack management, thread management, security and exception management, and garbage collection and other functions.
  • System libraries can include multiple functional modules. For example: surface manager (surface manager), media libraries (Media Libraries), 3D graphics processing libraries (for example: OpenGL ES), 2D graphics engines (for example: SGL), etc.
  • the surface manager is used to manage the display subsystem and provides the integration of 2D and 3D layers for multiple applications.
  • the media library supports playback and recording of a variety of commonly used audio and video formats, as well as static image files, etc.
  • the media library can support a variety of audio and video encoding formats, such as: MPEG4, H.254, MP3, AAC, AMR, JPG, PNG, etc.
  • the 3D graphics processing library is used to implement 3D graphics drawing, image rendering, composition, and layer processing.
  • 2D Graphics Engine is a drawing engine for 2D drawing.
  • the kernel layer is located below the HAL and is the layer between hardware and software.
  • the kernel layer at least includes a display driver, a camera driver, an audio driver, a sensor driver, etc. This embodiment of the present application does not impose any restrictions on this.
  • a camera service can be set in the application framework layer.
  • the camera application can start the Camera Service by calling the preset API.
  • the Camera Service can interact with the Camera HAL in the Hardware Abstraction Layer (HAL) during operation.
  • Camera HAL is responsible for interacting with the hardware devices (such as cameras) that implement the shooting function in the mobile phone.
  • Camera HAL hides the implementation details of the relevant hardware devices (such as specific image processing algorithms), and on the other hand, it can provide the Android system with Call the interface of related hardware devices.
  • the relevant control commands issued by the user can be sent to the Camera Service.
  • the Camera Service can send the received control command to the Camera HAL, so that the Camera HAL can call the camera driver in the kernel layer according to the received control command, and the camera driver drives the camera and other hardware devices to respond to the control command to collect images. data.
  • the camera can pass each frame of image data collected to the Camera HAL through the camera driver according to a certain frame rate.
  • the transfer process of control commands within the operating system can be seen in the specific transfer process of the control flow in Figure 5B.
  • the Camera Service can determine the shooting strategy at this time based on the received control command.
  • the shooting strategy sets specific image processing tasks that need to be performed on the collected image data. For example, in preview mode, Camera Service can set image processing task 1 in the shooting strategy to implement the face detection function. For another example, if the user turns on the beautification function in preview mode, the Camera Service can also set image processing task 2 in the shooting policy to implement the beautification function. Furthermore, the Camera Service can send the determined shooting strategy to the Camera HAL.
  • Camera HAL When Camera HAL receives each frame of image data collected by the camera, it can perform corresponding image processing tasks on the above image data according to the shooting strategy issued by Camera Service, and obtain each frame of shooting screen after image processing. For example, Camera HAL can perform image processing task 1 on each frame of image data received according to shooting strategy 1 to obtain the corresponding shooting picture of each frame. When shooting strategy 1 is updated to shooting strategy 2, Camera HAL can perform image processing task 2 on each frame of image data received according to shooting strategy 2 to obtain the corresponding shooting picture of each frame.
  • Camera HAL can report each frame of the shot after image processing to the camera application through the Camera Service.
  • the camera application can display each frame of the shot in the display interface, or the camera application can display the shot in the form of a photo or video.
  • Each frame of the shot is saved in the mobile phone.
  • This embodiment of the present application will introduce the working principle of each software layer in the mobile phone to implement the method of the embodiment of the present application with reference to FIG. 5B.
  • Camera HAL can call the camera driver in the kernel layer based on the previously received recording instruction, and the camera driver drives the camera and other hardware devices to collect image data in response to the recording instruction.
  • the camera can pass each frame of image data collected to the Camera HAL through the camera driver according to a certain frame rate.
  • the data stream composed of each frame of image passed by the camera driver to the Camera HAL based on the recording instruction can be the video stream (such as a preview stream and a recording stream) described in the embodiments of this application.
  • the Camera Service can determine that the shooting strategy 3 at this time is to capture the image in the video according to the received capture command.
  • the shooting strategy sets a specific image processing task 3 that needs to be performed on the collected image data.
  • the image processing task 3 includes cropping processing according to the cropping method and cropping parameters of the target preview image.
  • the image processing task 3 is used to achieve Capture function during video recording.
  • the Camera Service can send the determined shooting strategy 3 to the Camera HAL.
  • Camera HAL When Camera HAL receives each frame of image data collected by the camera, it can perform corresponding image processing tasks 3 on the above image data according to the shooting strategy 3 issued by the Camera Service, such as cropping the target preview image according to the cropping method and cropping parameters.
  • image processing tasks 3 such as cropping the target preview image according to the cropping method and cropping parameters.
  • the m frames of first images including the second image are cropped to obtain corresponding snapshot images.
  • each frame of image (ie, the first image) exposed and output by the image sensor (Sensor) of the camera can be buffered in the first buffer queue (Buffer).
  • the first cache queue (Buffer) can be set at any layer of the mobile phone software system.
  • the first cache queue (Buffer) can be set in the memory area accessed by the Camera HAL through the software interface.
  • Camera HAL can select a capture frame from the Buffer based on the metadata of the multi-frame Bayer image (i.e. the first image) cached in the Buffer. In this way, the mobile phone can obtain the captured frame with higher image quality from the first cache queue.
  • the additional information of the first image may include the contrast of the first image and the angular velocity when the camera collects the first image. It should be understood that the smaller the angular velocity is, the smaller the jitter is when the camera collects the first image; the larger the angular velocity is, the greater the jitter is when the camera collects the first image. Contrast is used to characterize the sharpness of the first image. The higher the contrast, the clearer the first image. In this way, according to the additional information of the first image, the Bayer image with small jitter and maximum image clarity can be selected from the multiple frames of Bayer images (ie, the first image) cached in the Buffer as the capture frame.
  • the additional information of each frame of Bayer image cached in the Buffer can be obtained by the ISP of the camera at the hardware layer assigning metadata (metadata) to each frame of Bayer image in the Buffer.
  • ISP can be divided into statistical modules and processing modules according to functions.
  • the statistics module can include (image front end, IFE), and the processing module can include (image processing engine, IPE) and (bayer processing segment, BPS).
  • the above additional information of the Bayer image can be obtained by assigning the metadata of each frame of the Bayer image in the Buffer by the statistics module of the ISP.
  • the processing module of ISP is used to process the image output by the Sensor exposure.
  • the above angular velocity can be collected by a gyroscope in the electronic device.
  • the software code for scheduling the gyroscope is stored in the HAL.
  • Camera HAL can call the gyroscope driver in the kernel layer, and the gyroscope driver drives the gyroscope to collect the angular velocity of the electronic device.
  • the angular velocity of the electronic device is the angular velocity of the camera.
  • the angular velocity of the camera may be different at different times, and the camera's Sensor can expose and output different Bayer images at different times.
  • Camera HAL can also call the camera driver in the kernel layer, and the camera driver drives the statistics module of the ISP in the camera to write the angular velocity collected by the gyroscope into the metadata of the Bayer image output by the Sensor.
  • the additional information of the Bayer image also includes the time when the Sensor exposes and outputs the Bayer image.
  • the statistics module of the ISP can determine the angular velocity of the Bayer image based on the acquisition time of the angular velocity and the exposure time of the Bayer image, and write the angular velocity of the Bayer image into the metadata of the Bayer image.
  • the statistics module of the ISP can also analyze the Bayer image, obtain the contrast of the Bayer image, and write the contrast of the Bayer image into the metadata of the Bayer image.
  • the Sensor exposure end time can be used as a timestamp; on others
  • the platform can use the Sensor's start exposure time as a timestamp, and the embodiments of this application do not limit this.
  • the above-mentioned exposure end time and exposure start time are collectively referred to as exposure time.
  • the statistics module of the ISP can write the angular velocity and contrast of each Bayer image in the Buffer into the metadata of the corresponding Bayer image through a first preset interface, such as the first camera serial interface (CSI).
  • a first preset interface such as the first camera serial interface (CSI).
  • the above-mentioned first preset CSI may be a software interface between the Sensor and the Buffer.
  • Camera HAL can also perform first image processing (including cropping processing) on the m-frame first image including the second image through the second preset interface (such as the second preset CSI) to obtain the captured image.
  • the above-mentioned second preset CSI may be a software interface between the Buffer and the module that performs the first image processing. Then, Camera HAL can report the captured image to the camera application through Camera Service, and the camera application can save the captured image in the form of a photo on the phone.
  • Camera HAL can also include preset RAW domain image processing algorithms.
  • Camera HAL can call the preset RAW domain image processing algorithm through the second preset interface (such as the second preset CSI) to process the captured frame and the adjacent frames of the captured frame to obtain the processed image frame.
  • the above-mentioned second preset CSI may be a software interface between the Buffer and the preset RAW domain image processing algorithm.
  • Camera HAL can call the encoder (ENCODE) to encode the image frame, and then a frame of captured image can be obtained.
  • Camera HAL can report captured images to the camera application through Camera Service, and the camera application can save the captured images in the form of photos on the phone.
  • the Camera HAL also includes a perception module.
  • the sensing module can determine whether the mobile phone is in a high dynamic range (HDR) scene based on the captured frame and adjacent frames of the captured frame.
  • HDR high dynamic range
  • the preset RAW domain image processing algorithm performs different image processing processes in HDR scenes and non-HDR scenes.
  • image processing process of the preset RAW domain image processing algorithm in HDR scenes and non-HDR scenes please refer to the detailed introduction in the following embodiments, which will not be described again here.
  • the embodiment of the present application provides a method for capturing images during video recording, which method can be applied to a mobile phone including a camera. As shown in Figure 6, the method may include S601-S605.
  • the mobile phone receives the user's first operation.
  • the first operation is used to trigger the mobile phone to start recording video.
  • the mobile phone may display the video viewing viewfinder interface 701 shown in FIG. 7 .
  • the viewfinder interface 701 of the video recording is the viewfinder interface of the mobile phone that has not yet started recording.
  • the recording viewfinder interface 701 includes a "Start Recording" button 702 .
  • the above-mentioned first operation may be the user's click operation on the "Start Recording" button 702, which is used to trigger the mobile phone to start recording video.
  • the camera of the mobile phone collects the first image, and the mobile phone displays the first interface.
  • the first interface is a viewfinder interface where the mobile phone is recording a video.
  • the viewfinder interface displays a preview stream, and the preview stream includes a preview image obtained from the first image.
  • the first interface also includes a capture shutter, which is used to trigger the mobile phone to capture images to obtain photos.
  • the mobile phone's camera can start to collect images (ie, the first image), and the mobile phone's display screen can display the first interface 703 shown in Figure 7.
  • the first interface 703 is a viewfinder interface where the mobile phone is recording video.
  • the first interface 703 includes a preview image 704 obtained from the above-mentioned first image.
  • the multi-frame preview image 704 may constitute the one shown in Figure 1, Figure 4A, Figure 4C or Figure 4D. Preview stream.
  • the embodiment of the present application introduces a method for the mobile phone to obtain the preview image 704 from the first image.
  • the mobile phone can process the first image according to the processing method of the preview stream shown in Figure 1, Figure 4A, Figure 4C or Figure 4D to obtain a preview. Image 704.
  • the ISP of the mobile phone can use the ISP to process the first image of each frame collected by the camera.
  • the image sensor (Sensor) of the mobile phone is controlled by exposure and can continuously output Bayer images.
  • the ISP of the mobile phone After each frame of Bayer image is processed by the ISP of the mobile phone, it is sent to the encoder 1 (ENCODER) for encoding, and a preview image 704 can be obtained.
  • the processed multi-frame preview image 704 may form a preview video stream (ie, preview stream).
  • the first interface 703 also includes a snapshot shutter 702 .
  • the capture shutter 702 is used to trigger the mobile phone to capture images to obtain photos.
  • the capture shutter 702 is used to trigger the mobile phone to capture images and obtain photos during the video recording process. It is conceivable that during the process of recording video (that is, video recording) on a mobile phone, some wonderful pictures may be collected. During the video recording process on the mobile phone, the user may hope that the mobile phone can capture the above wonderful scenes and save them as photos to show to the user. In the embodiment of the present application, the user can click the above-mentioned snapshot shutter 702 to realize the function of capturing wonderful pictures during the video recording process.
  • the mobile phone can cache the Sensor exposure output Bayer image in a first buffer queue (Buffer).
  • a delay time as shown in Figure 3 (such as 120ms-160ms); within this delay time, the Sensor's frames can be cached in the Buffer. Therefore, when the mobile phone receives the user's capture operation, the Bayer image output by the Sensor can also be cached in the first cache queue.
  • the mobile phone can select a frame with better image quality from the Buffer as a snapshot image.
  • the mobile phone may also perform S603.
  • the mobile phone caches the first image collected by the camera in the first cache queue.
  • the first cache queue caches n frames of first images collected by the camera, n ⁇ 1, and n is an integer.
  • the mobile phone may buffer the first image collected by the camera in the first buffer queue (Buffer) shown in FIG. 8 .
  • the first buffer queue can buffer n frames of first images collected by the camera on a first-in, first-out basis.
  • the tail of the first cache queue can perform an enqueue operation for inserting the first image; the head of the first cache queue can perform a dequeuing operation for deleting the first image.
  • the head of the first cache queue deletes one frame of the first image.
  • n may equal 1.
  • one frame of the first image can be cached in the first cache queue.
  • the electronic device can only perform first image processing on one frame of the first image (ie, the second image) to obtain a captured image.
  • n may be greater than 1.
  • multiple frames of the first image can be cached in the first cache queue.
  • the electronic device can perform image processing on one frame of the first image to obtain a snapshot image, or can perform image processing on multiple frames of the first image to obtain a snapshot image.
  • electronic The device performs image processing on the first image of multiple frames to obtain a captured image, which can enhance the image quality of the captured frame (i.e., the reference frame), which is beneficial to obtaining information such as noise and texture, and can further improve the image quality of the output image.
  • the mobile phone selects a second image from n frames of first images cached in the first cache queue according to the additional information of the first image.
  • the additional information of the first image includes at least one of the contrast of the first image, the angular velocity when the camera collects the first image, and the timestamp of the first image.
  • the above-mentioned second operation may be the user's click operation on the snap shutter.
  • the second operation may be a user's click operation on the snapshot shutter shown in FIG. 7 .
  • the second operation may be the user's continuous clicking operation on the snap shutter.
  • each click operation of the snap shutter is used to trigger the mobile phone to perform the following operation: "Select the second image from n frames of the first image cached in the first cache queue according to the additional information of the first image.” and S605.
  • a click on the capture shutter is used to trigger the phone to capture a photo.
  • Continuous clicks on the capture shutter are used to trigger the phone to capture multiple photos.
  • the method of capturing multiple photos with the mobile phone during video recording is similar to the method of capturing one photo, and will not be described in detail here.
  • the Camera HAL in the HAL of the mobile phone may include a frame selection module.
  • the frame selection module can select the second image (i.e. the capture frame) from the n frames of first images buffered in the first cache queue (Buffer) based on the additional information of the first image. , also called a reference frame).
  • the additional information of the first image includes at least one of the contrast of the first image, the angular velocity when the camera collects the first image (referred to as the angular velocity of the first image), and the timestamp of the first image.
  • contrast can also be called gradient.
  • gradient can also be called sharpness.
  • sharpness The greater the sharpness of an image, the clearer the image. That is, contrast can be used to characterize the clarity of the first image. The higher the contrast of an image (such as the first image), the clearer the image.
  • the angular velocity can be collected by a gyroscope sensor.
  • the value of the angular velocity of a frame of image i.e., the first image
  • the value of the angular velocity of a frame of image can represent the magnitude of the angular velocity when the camera (such as a sensor of the camera) collects the image.
  • the smaller the angular velocity the smaller the jitter when the camera collects the first image; the larger the angular velocity, the greater the jitter when the camera collects the first image.
  • the mobile phone (such as the frame selection module in the HAL of the mobile phone) can select a frame with small jitter from the multi-frame Bayer images (i.e. the first image) buffered in the first cache queue (Buffer) based on the additional information of the first image.
  • the Bayer image with the largest image definition is used as the captured frame (i.e., the second image).
  • a mobile phone (such as the frame selection module in the HAL of the mobile phone) can traverse the n frames of the first image cached in the Buffer, and select a frame with small jitter from the n frames of the first image cached in the Buffer based on the additional information of the first image.
  • the first image with the largest image definition is used as the second image.
  • the method for the mobile phone to select the second image from n frames of first images cached in the first buffer queue (Buffer) according to the additional information of the first image may include Sa.
  • Sa The mobile phone selects the first image with the largest contrast from the n frames of first images cached in the first cache queue (Buffer).
  • the contrast of the first image of one frame is the largest, which is greater than the contrast of other first images other than the first image of this frame.
  • the mobile phone can use the first image of the frame with the highest contrast as the second image.
  • the n frames of first images cached in the first buffer queue may include at least two frames of first images with the same contrast. Moreover, the contrast of the at least two frames of first images is greater than the contrast of other first images in the n frames of first images.
  • the phone can also perform Sb. Sb: The mobile phone selects the first image with the smallest angular velocity as the second image from the at least two frames of first images.
  • the metadata of the Bayer image (ie, the first image) exposed by the Sensor does not include the above additional information.
  • the additional information of the first image may be obtained by assigning the metadata of each frame of Bayer image in the first cache queue (Buffer) by the statistics module of the ISP.
  • the frame selection module in the mobile phone can select the second image from the n frames of first images cached in the first cache queue according to the additional information of the first image.
  • each frame of the first image includes a timestamp, and the timestamp records the time when the image sensor Sensor outputs the corresponding first image (ie, the exposure time). This timestamp may also be included in the metadata of the first image.
  • the clock of the upper-layer application of the mobile phone is synchronized with the clock of the Sensor that records the first image and outputs the picture; or, the clock of the upper-layer application of the mobile phone and the clock of the Sensor that records the first image and outputs the picture are the same system clock.
  • the time when the mobile phone receives the second operation ie, the capture operation
  • the mobile phone can select the acquisition time and the time of the first image based on the time when the gyroscope sensor collects each angular velocity.
  • the angular velocity with the latest recorded exposure time is used as the angular velocity of the first image.
  • the statistics module of the ISP can write the angular velocity of the first image into the metadata of the first image.
  • the Sensor exposure end time can be used as a timestamp; on other platforms, the Sensor exposure start time can be used as a timestamp, and the embodiments of this application do not limit this.
  • the above-mentioned exposure end time and exposure start time are collectively referred to as exposure time.
  • the mobile phone (such as the frame selection module in the HAL of the mobile phone) can, based on the timestamp of the first image, From n frames of first images cached in the first buffer queue (Buffer), a first frame of the first image whose time indicated by the timestamp is closest to the time when the user triggers the capture is selected as the capture frame (ie, the second image).
  • the mobile phone (such as the frame selection module in the HAL of the mobile phone) performs "selecting the second image from n frames of the first image buffered in the first buffer queue (Buffer)", the first The image is abnormally judged, the abnormal frame in the Buffer (that is, the abnormal first image) is discarded, and the second image is selected from the normal first image in the Buffer.
  • the mobile phone (such as the frame selection module in the HAL of the mobile phone) can compare the exposure time of the first image of the frame (denoted as image frame a), and the first image of the previous frame of the image frame a (denoted as image frame a).
  • the exposure time of b) is used to determine whether image frame a is abnormal.
  • the exposure time of each frame of Bayer image output by the Sensor will generally not change significantly. For example, the exposure time of adjacent image frames will not suddenly become very high, or the exposure time of adjacent image frames will not suddenly change. Very low. For example, the difference in exposure time of adjacent image frames generally does not exceed 10 milliseconds (ms), and the maximum difference does not exceed 20 ms.
  • the preset exposure threshold can be less than 20ms and takes a value around 10ms.
  • the preset exposure threshold can be 10ms, 9ms, 11ms or 8ms etc.
  • the method for the mobile phone (such as the frame selection module in the HAL of the mobile phone) to select the second image from the normal first image in the Buffer can refer to the method described in the above embodiments, which will not be described in detail here. .
  • the mobile phone performs first image processing on m frames of first images including the second image to obtain a captured image.
  • the first image processing includes reference processing according to the cropping method and cropping parameters of the target preview image.
  • the target preview image is a frame of image collected by the camera when the mobile phone receives the second operation in the above preview stream, m ⁇ 1, m is integer.
  • m may equal 1. That is to say, the m-frame first image is the above-mentioned second image. That is, the mobile phone performs the first image processing on the above-mentioned second image to obtain a capture image with higher image quality.
  • the integrity of data and texture and other parameters in a frame of image are limited. The mobile phone only performs the first image processing on one frame of image, which may not effectively improve the image quality of this frame of image.
  • m may be greater than 1.
  • the mobile phone can perform first image processing on the second image and the first image of m-1 frames. That is, the first image processing can be performed on m frames of first images including the second image among n frames of first images.
  • the second image in the m-frame first image i.e., m-1 frame first image
  • can enhance the image quality of the captured frame i.e., the reference frame
  • texture information which can further improve the quality of captured images.
  • the m frames of first images are m consecutive m frames of images including the second image among n frames of first images. That is, the m frames of the first image are the adjacent m frames of the first image including the second image.
  • the image content of adjacent images is more similar, so using m consecutive m-frame images to obtain the captured image is more conducive to improving the image quality of the captured image.
  • the m frames of first images include the second image, and m-1 frames of first images among n frames of first images whose resolution is greater than a preset resolution threshold.
  • the resolution of the first image in a certain frame is greater than the preset resolution threshold, it means that the resolution of the first image is larger, which can also be understood as the clarity of the first image is higher.
  • the mobile phone can
  • the first image is determined to be one frame of image included in the above-mentioned m frames of first images (specifically, m-1 frames of first images). That is, the above m-1 first image frame is the first image with a larger resolution among the n first image frames. Therefore, using the first image with a larger resolution to obtain the captured image is more conducive to improving the image quality of the captured image.
  • the m-frame first image includes the second image, and the m-1 first frame in the n-frame first image whose high dynamic range (high dynamic range, HDR) parameter satisfies the preset HDR condition. image.
  • high dynamic range high dynamic range
  • HDR high dynamic range
  • the mobile phone can determine that the first image is an image included in the above-mentioned m frames of first images (specifically, m-1 frames of first images). That is, the above m-1 first image frame is the first image with higher image quality among the n first image frames. Therefore, using the first image with higher image quality to obtain the captured image is more conducive to improving the image quality of the captured image.
  • the first image processing described in the embodiments of the present application is an image processing process of multi-frame input and single-frame output.
  • the electronic device performs first image processing on m frames of first images including the second image, which can improve the image quality of the captured image, and can improve the image quality of the captured image.
  • first image processing is implemented by the electronic device based on software.
  • Implementing the first image processing process on the m-frame first image including the second image in a software-based manner can improve the image processing efficiency and save the power consumption of the mobile phone.
  • the cropping method of the target preview image includes a center cropping method
  • the cropping parameters of the target preview image include the coordinates of the center point of the cropping area of the target preview image and the cropped size information.
  • a frame of the preview image and a frame of the first image may be full size images, and the mobile phone can crop the preview image and the first image according to the corresponding cropping method and cropping parameters.
  • the above-mentioned cropped size information may be the size information of the cropped area, or may be the size information of the remaining area, and the remaining area is the image obtained after cropping the first image.
  • the cropped size information includes width and height information.
  • the mobile phone can use ISP to process the first image to obtain a preview image through time division multiplexing, use ISP (such as the statistics module of ISP) to assign a value to the metadata of the first image, and ISP First image processing is performed on m frames of first images including second images.
  • ISP such as the statistics module of the ISP
  • the ISP performs the first image processing on the m-frame first image including the second image. This will not affect the mobile phone's use of the ISP to process the first image.
  • Get preview image In other words, the mobile phone's processing of the capture stream shown in Figure 4C or Figure 4D will not affect the mobile phone's processing of video streams (such as preview streams and recording streams).
  • the preset RAW domain image processing algorithm described in the embodiment of this application is a neural network model with multi-frame input and single-frame output.
  • the preset RAW domain image processing algorithm is a RAW domain image quality enhanced deep learning network.
  • the use of preset RAW domain image processing algorithms can improve the quality of captured images and help improve the image quality of captured images.
  • the mobile phone in response to the user's click operation on the snapshot shutter shown in Figure 7 (ie, the second operation), the mobile phone can generate and save the snapshot photo. However, while the mobile phone is recording, the user cannot view the captured photo. Users can view the captured photos in the photo album after the recording ends.
  • the mobile phone in response to the user's click operation on the "end recording" button 706 shown in FIG. 9 , the mobile phone may display the video framing interface 901 shown in FIG. 9 .
  • the video viewing viewfinder interface 901 is the viewfinder interface where the mobile phone has not started recording. Compared with the viewfinder interface 701 of the video recording shown in Figure 7, the photos in the photo option in the viewfinder interface of the mobile phone are updated from 708 shown in Figure 7 to 902 shown in Figure 9.
  • the mobile phone may display the photo album list interface 1001 shown in FIG. 10 in response to the user's startup operation of the photo album application.
  • the photo album list interface 1001 includes multiple photos and videos saved in the mobile phone.
  • the album list interface 1001 includes a video 1003 recorded by the mobile phone, and a photo 1002 captured by the mobile phone during the recording of the video 1003.
  • the mobile phone can cache the Sensor exposure output Bayer image in a first buffer queue (Buffer).
  • the first cache queue can cache multiple frames of Bayer images.
  • the Sensor's frames can be cached in the Buffer. Therefore, when the mobile phone receives the user's capture operation, the Bayer image output by the Sensor can also be cached in the first cache queue.
  • the image content of the frame output by the Sensor will not change much in a short period of time; therefore, the mobile phone can select a frame with better image quality from the Buffer as a snapshot image.
  • the mobile phone can also perform first image processing on m frames of first images including the second image to obtain a captured image. Since the first image processing includes cropping processing according to the cropping method and cropping parameters of the target preview image; therefore, the mobile phone can crop the m frames of the first image including the second image according to the cropping method and cropping parameters of the target preview image, and can Obtaining a captured image with the same FOV as the target preview image can improve the image quality of the captured image.
  • images that meet user needs can be captured during the video recording process, and the image quality of the captured images can be improved.
  • the mobile phone performs first image processing on m frames of first images including the second image to obtain a captured image (ie, S605), which may specifically include S1101-S1103.
  • the mobile phone performs image fusion on the m-frame first image to obtain a third image.
  • the mobile phone can perform image fusion on multiple frames of the first image based on the fusion algorithm.
  • image fusion of the m-frame first image can enhance the image quality of the captured frame (i.e., the reference frame), which is beneficial to obtaining information such as noise and texture, and can further improve the image quality of the captured image.
  • the mobile phone performs cropping processing on the third image according to the cropping method and cropping parameters of the target preview image to obtain a fourth image.
  • image fusion of the m-frame first image of the mobile phone to obtain the third image is a process of generating one frame of image based on multiple frames of images.
  • the mobile phone only needs to crop one frame of image (ie, the third image), which can improve the image processing efficiency and save the power consumption of the mobile phone.
  • the mobile phone performs second image processing on the fourth image to obtain a captured image.
  • the second image processing includes: at least one of image noise reduction, brightness and acceptance correction, and image beautification processing.
  • picture noise reduction is used to reduce noise in images, which can improve the clarity of images and improve the image quality of captured images.
  • Brightness and acceptance correction are used to calibrate the brightness and color of images, and can also improve the image quality of captured images.
  • Mobile phones can beautify images based on skin beautification algorithms, which can improve the aesthetics of images and improve the display effect of captured images.
  • the mobile phone can perform image fusion on m frames of first images to obtain a third image, and perform cropping processing on the third image according to the cropping method and cropping parameters of the target preview image to obtain a fourth image. That is, the electronic device can improve the image processing efficiency and save the power consumption of the mobile phone by fusing multiple frames of the first image into one frame of image (ie, the third image) and only need to crop the one frame of image.
  • the mobile phone can also perform image noise reduction, brightness and acceptance correction, and image beautification processing on the fourth image, which can improve the image quality and display effect of the captured image.
  • each frame of Bayer image output by the image sensor (Sensor) of the mobile phone passes through the mobile phone's image front-end engine, the first cache queue, the frame selection module, the multi-frame fusion module, the Bayer processing module, and the image processing module.
  • Engine and stylization processing module you can obtain the above captured images.
  • the first cache queue is used to cache the first image collected by the camera.
  • the frame selection module is used to select a second image from n frames of first images buffered in the first buffer queue.
  • the multi-frame fusion module is used to perform image fusion on m frames of the first image to obtain a third image.
  • the Bayer processing module is used to crop the third image according to the cropping method and cropping parameters of the target preview image to obtain the fourth image; the Bayer processing module is also used to perform brightness and acceptance correction processing on the fourth image.
  • the stylization processing module is used to perform color processing, picture beautification processing, high dynamic processing, etc. on the fourth image.
  • the Bayer processing module can obtain the cropping method and cropping parameters of the target preview image from the anti-shake module.
  • the mobile phone obtains the logical identifier of the target preview image.
  • the logical identifier of the target preview image is used to identify the camera that collected the target preview image.
  • the mobile phone may include N cameras, where N is a positive integer greater than 1.
  • the mobile phone can capture the first image through multiple cameras.
  • the mobile phone determines m frames of first images including the second image from n frames of first images based on the logical identification of the target preview image.
  • the logical identifier of the first image of the m frame is the same as the logical identifier of the target preview image.
  • one logical identifier corresponds to one camera.
  • the logical identifier of the first image of the m frame is the same as the logical identifier of the target preview image, indicating that the camera that collects the first image of the m frame and the camera that collects the target preview image are the same camera.
  • the mobile phone can determine a first frame of n frames of first images collected by the same camera as the camera that collects the target preview image as a frame of m frames of first images. It is possible to conveniently and quickly determine m frames of first images from n frames of first images, thereby improving the efficiency of generating captured images.
  • one logical identifier can correspond to multiple cameras, or it can also be understood that one logical identifier corresponds to a camera set, and the camera set includes multiple cameras.
  • the logical identifier of the first image of the m frame is the same as the logical identifier of the target preview image, indicating that the camera that collects the first image of the m frame and the camera that collects the target preview image belong to the same camera set.
  • the mobile phone can determine a frame of first images collected by each of the multiple cameras included in the same camera set as one frame of m frames of first images. A large number of first images to be subjected to first image processing can be determined, thereby improving the image quality of the captured images.
  • the mobile phone can also collect the current temperature of the mobile phone. If the current temperature of the mobile phone is greater than the preset temperature threshold, then the m-frame first image only includes the second image; if the current temperature of the mobile phone is less than or equal to the preset temperature threshold, then m ⁇ 2.
  • the mobile phone can enable the thermal escape mechanism. Specifically, only one frame of image (i.e., the second image) is used when performing the first image processing, which can reduce the operating load of the mobile phone and save the power consumption of the mobile phone.
  • the mobile phone can obtain the captured image based on the first image of multiple frames, which can improve the image quality of the captured image.
  • an electronic device which may include: the above-mentioned display screen, camera, memory, and one or more processors.
  • the display, camera, memory and processor are coupled.
  • the memory is used to store computer program code, which includes computer instructions.
  • the processor executes the computer instructions, the electronic device can perform each function or step performed by the mobile phone in the above method embodiment.
  • the structure of the electronic device may refer to the structure of the electronic device shown in FIG. 5A.
  • the chip system 1400 includes at least one processor 1401 and at least one interface circuit 1402.
  • the processor 1401 and the interface circuit 1402 may be interconnected by wires.
  • interface circuitry 1402 may be used to receive signals from other devices, such as memory of an electronic device.
  • interface circuit 1402 may be used to send signals to other devices (eg, processor 1401).
  • the interface circuit 1402 can read instructions stored in the memory and send the instructions to the processor 1401.
  • the electronic device can be caused to perform various steps in the above embodiments.
  • the chip system may also include other discrete devices, which are not specifically limited in the embodiments of this application.
  • Embodiments of the present application also provide a computer storage medium.
  • the computer storage medium includes computer instructions.
  • the electronic device When the computer instructions are run on the above-mentioned electronic device, the electronic device causes the electronic device to perform various functions or steps performed by the mobile phone in the above-mentioned method embodiments. .
  • Embodiments of the present application also provide a computer program product.
  • the computer program product When the computer program product is run on a computer, it causes the computer to perform various functions or steps performed by the mobile phone in the above method embodiments.
  • the disclosed devices and methods can be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of modules or units is only a logical function division.
  • there may be other division methods for example, multiple units or components may be The combination can either be integrated into another device, or some features can be omitted, or not implemented.
  • the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated.
  • the components shown as units may be one physical unit or multiple physical units, that is, they may be located in one place, or they may be distributed to multiple different places. . Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • each functional unit in each embodiment of the present application can be integrated into one processing unit, each unit can exist physically alone, or two or more units can be integrated into one unit.
  • the above integrated units can be implemented in the form of hardware or software functional units.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a readable storage medium.
  • the technical solutions of the embodiments of the present application are essentially or contribute to the existing technology, or all or part of the technical solution can be embodied in the form of a software product, and the software product is stored in a storage medium , including several instructions to cause a device (which can be a microcontroller, a chip, etc.) or a processor to execute all or part of the steps of the methods described in various embodiments of this application.
  • the aforementioned storage media include: U disk, mobile hard disk, read only memory (ROM), random access memory (RAM), magnetic disk or optical disk and other media that can store program code.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Studio Devices (AREA)

Abstract

本申请公开了一种录像中抓拍图像的方法及电子设备,涉及拍摄技术领域,可在录像中抓拍图像,提升抓拍图像的图像质量。电子设备响应于第一操作,采集第一图像,显示第一界面;第一界面是电子设备正在录制视频的取景界面;在第一缓存队列缓存摄像头采集的n帧第一图像;响应于用户对抓拍快门的第二操作,根据第一图像的附加信息,从第一缓存队列缓存的n帧第一图像中选择出第二图像;对包括第二图像的m帧第一图像进行第一图像处理,得到抓拍图像;其中,该第一图像处理包括按照目标预览图像的裁剪方式和裁剪参数的裁剪处理;目标预览图像是预览流中、电子设备接收到第二操作时摄像头采集的一帧图像。

Description

一种录像中抓拍图像的方法及电子设备
本申请要求于2022年09月14日提交国家知识产权局、申请号为202211116174.5、发明名称为“一种录像中抓拍图像的方法及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及拍摄技术领域,尤其涉及一种录像中抓拍图像的方法及电子设备。
背景技术
现有的手机一般具有拍照和录像功能,越来越多的人使用手机拍摄照片和视频来记录生活的点点滴滴。其中,手机录制视频(即录像)的过程中,可能会采集到的一些精彩的画面。在手机录像的过程中,用户可能会希望手机可以抓拍到上述精彩的画面,并保存成照片展示给用户。因此,亟待一种可以实现在录像过程中抓拍图像的方案。
在一些方案中,手机可以截取视频流(如预览流或录像流)中用户抓拍瞬间采集的一帧图像,作为抓拍图像保存成照片展示给用户。但是,手机录像过程中,每秒需要处理大量图像(如30帧图像)。如此,留给每一帧图像的运算资源和时间都是有限的;因此,手机一般可以使用ISP的硬件处理模块,采用较为简单的处理方式来处理视频流;而不会使用复杂的算法来提升画质。这样的图像处理效果,只能满足视频的要求;而拍照对画质的要求则更高。因此,截取视频流中的图像,并不能抓拍到用户满意的图像。
发明内容
本申请提供一种录像中抓拍图像的方法及电子设备,可以在录像过程中抓拍图像,并且可以提升抓拍图像的图像质量。
本申请实施例的技术方案如下:
第一方面,提供一种录像中抓拍图像的方法,该方法可以应用于电子设备。该方法中,电子设备可以接收用户的第一操作。该第一操作用于触发电子设备开始录制视频。响应于第一操作,电子设备的摄像头采集第一图像,电子设备显示第一界面。其中,第一界面是电子设备正在录制视频的取景界面,取景界面显示预览流,预览流包括由第一图像得到的预览图像。第一界面还包括抓拍快门。抓拍快门用于触发电子设备抓拍图像得到照片。电子设备可以在第一缓存队列缓存摄像头采集的第一图像。其中,第一缓存队列缓存摄像头采集的n帧第一图像,n≥1,n为整数。
之后,电子设备响应于用户对抓拍快门的第二操作,可以根据第一图像的附加信息,从第一缓存队列缓存的n帧第一图像中选择出第二图像。该第一图像的附加信息包括第一图像的对比度、摄像头采集第一图像时的角速度和第一图像的时间戳中的至少一个。最后,电子设备可以对包括第二图像的m帧第一图像进行第一图像处理,得到抓拍图像。其中,第一图像处理包括按照目标预览图像的裁剪方式和裁剪参数的简 餐处理。目标预览图像是预览流中、电子设备接收到第二操作时摄像头采集的一帧图像。m≥1,m为整数。上述第一图像处理具备提升图像画质的功能。
一方面,本申请实施例中,电子设备(如手机)可以将图像传感器(Sensor)曝光输出Bayer图像缓存在一个第一缓存队列(Buffer)中。该Buffer可以缓存Bayer图像。如此,即使从接收到用户的抓拍操作到Snapshot程序接收到抓拍指令,存在延迟时长(如120ms-160ms);在这段延迟时长内Sensor出帧都可以缓存在Buffer中。因此,电子设备接收到用户的抓拍操作时,Sensor输出的Bayer图像也可以缓存在第一缓存队列中。并且,短时间内Sensor出帧的图像内容不会发生太大变化。如此,可以由电子设备的选帧模块根据Buffer中缓存的图像的附加信息,从Buffer中选择出图像质量较好的一帧图像作为抓拍图像。这样,可以提升抓拍图像的图像质量。
另一方面,电子设备还可以对包括第二图像的m帧第一图像进行第一图像处理得抓拍图像。由于该第一图像处理包括按照目标预览图像的裁剪方式和裁剪参数的裁剪处理,因此电子设备可以按照目标预览图像的裁剪方式和裁剪参数对包括第二图像的m帧第一图像进行裁剪处理,能够得到与目标预览图像的视场角(field of view,FOV)相同的抓拍图像,可以提升抓拍图像的画质。
综上所述,采用本申请的方法,可以在录像过程中抓拍到满足用户需求的图像,并且可以提升抓拍图像的图像质量。
在第一方面的一种可能的设计方式中,上述目标预览图像的裁剪方式包括中心裁剪方式。目标预览图像的裁剪参数包括目标预览图像的裁剪区域的中心点坐标和裁剪的尺寸信息。应理解,电子设备可以按照中心裁剪方式对包括第二图像的m帧第一图像进行裁剪处理。并且在对包括第二图像的m帧第一图像进行裁剪处理的过程中,可以按照目标预览图像的裁剪区域的中心点坐标和裁剪的尺寸信息,裁剪出与目标预览图像的大小相同并且FOV相同的抓拍图像。这样,能够提升抓拍图像的图像质量。
在第一方面的另一种可能的设计方式中,m≥2。电子设备对包括第二图像的m帧第一图像进行第一图像处理,得到抓拍图像,具体可以包括:电子设备对m帧第一图像进行图像融合,得到第三图像。电子设备按照目标预览图像的裁剪方式和裁剪参数,对第三图像进行裁剪处理,得到第四图像。电子设备对第四图像进行第二图像处理,得到抓拍图像。其中,第二图像处理包括图片降噪、亮度及验收校正和图片美化处理中的至少一种。应理解,电子设备通过将多帧第一图像融合为一帧图像(即第三图像),并且只需要对一帧图像进行裁剪处理,可以提升图像的处理效率,节省电子设备的功耗。另外,图片降噪和亮度及验收校正可以提升抓拍图像的图像质量,图片美化处理可以提高抓拍图像的显示效果,因此电子设备对第四图像进行第二图像处理,能够提升抓拍图像的图像质量及显示效果。
在第一方面的另一种可能的设计方式中,该方法还包括:电子设备获取目标预览图像的逻辑标识。目标预览图像的逻辑标识用于标识采集目标预览图像的摄像头。电子设备根据目标预览图像的逻辑标识,从n帧第一图像中确定出包括第二图像的m帧第一图像。其中,m帧第一图像的逻辑标识与目标预览图像的逻辑标识相同。在一种情况下,一个逻辑标识对应一个摄像头。该m帧第一图像的逻辑标识与该目标预览图像的逻辑标识相同,说明采集该m帧第一图像的摄像头与采集该目标预览图像的摄像 头为相同的摄像头。此时手机可以将n帧第一图像中,与采集目标预览图像的摄像头相同的摄像头采集的一帧第一图像,确定为m帧第一图像中的一帧图像。能够方便、快捷地从n帧第一图像中确定出m帧第一图像,进而可以提升抓拍图像的生成效率。在另一种情况下,一个逻辑标识可以对应多个摄像头,也可以理解为一个逻辑标识对应一个摄像头集合,该摄像头集合中包括多个摄像头。该m帧第一图像的逻辑标识与该目标预览图像的逻辑标识相同,说明采集该m帧第一图像的摄像头与采集该目标预览图像的摄像头属于同一个摄像头集合。此时手机可以将该同一个摄像头集合中包括的多个摄像头中每一个摄像头采集的一帧第一图像,确定为m帧第一图像中的一帧图像。可以确定出大量的、待进行第一图像处理的第一图像,进而可以提升抓拍图像的图像质量。
在第一方面的另一种可能的设计方式中,m帧第一图像为n帧第一图像中包括第二图像的连续m帧图像。或者,m帧第一图像包括第二图像,以及n帧第一图像中、分辨率大于预设分辨率阈值的m-1帧第一图像。或者,m帧第一图像包括第二图像,以及n帧第一图像中、高动态范围参数满足预设HDR条件的m-1帧图像。具体的,包括第二图像的连续的m帧图像为包括第二图像在内的相邻的m帧第一图像。相邻图像的图像内容的相似度更高,因此采用连续的m帧图像得到抓拍图像,更有利于提升抓拍图像的图像质量。m-1帧第一图像的分辨率大于预设分辨率阈值,说明m-1帧第一图像的分辨率较大,也可以理解为m-1帧第一图像的清晰度较高。因此,采用分辨率较大的第一图像得到抓拍图像,更有利于提升抓拍图像的图像质量。m-1帧第一图像的HDR参数满足预设HDR条件,说明m-1帧第一图像拥有很高的亮度动态范围以及更加丰富的色彩,也可以理解为m-1帧第一图像的画质较高。因此,采用画质较高的第一图像得到抓拍图像,更有利于提升抓拍图像的图像质量。
在第一方面的另一种可能的设计方式中,上述第一图像的附加信息包括第一图像的对比度,第一图像的对比度用于表征第一图像的清晰度。该第二图像为:第一缓存队列缓存的n帧第一图像中,对比度最高的第一图像。
其中,第一图像的对比度越高,则该第一图像的清晰度越高。在该设计方式中,电子设备可以从n帧第一图像中选择清晰度最高的图像作为第二图像(即备选的抓拍图像)。这样,有利于提升抓拍图像的图像质量。
在第一方面的另一种可能的设计方式中,上述第一图像的附加信息包括摄像头采集第一图像时的角速度,角速度用于表征摄像头采集第一图像时的抖动情况。上述第二图像为:第一缓存队列缓存的n帧第一图像中,角速度最小的第一图像。
其中,摄像头采集第一图像时的角速度越大,则摄像头采集该第一图像时抖动越大;摄像头采集第一图像时的角速度越小,则摄像头采集该第一图像时抖动越小。应理解,摄像头采集第一图像时抖动越小,则该第一图像的画质越清晰;摄像头采集第一图像时抖动越大,则该第一图像的画质越模糊。在该设计方式中,电子设备可以从n帧第一图像中选择抖动最小的图像作为第二图像(即备选的抓拍图像)。这样,有利于提升抓拍图像的图像质量。
在第一方面的另一种可能的设计方式中,上述第一图像的附加信息还包括第一图像的时间戳;每一帧第一图像中包括时间戳,时间戳记录有电子设备的图像传感器输 出对应第一图像的时间。其中,电子设备中上层应用的时钟与图像传感器记录第一图像出图的时钟同步;或者,电子设备中上层应用的时钟与图像传感器记录第一图像出图的时钟为同一系统时钟。第二图像为:第一缓存队列缓存的n帧第一图像中,时间戳记录的时间与电子设备接收到第二操作的时间最近的第一图像。
可以理解的是,如果一个第一图像的时间戳记录的时间与电子设备接收到第二操作的时间最近,则表示该第一图像是用户想要抓拍的图像的可能性越高。因此,电子设备将时间戳记录的时间与电子设备接收到第二操作的时间最近的第一图像作为第二图像,有利于抓拍到用户真实想要的图像内容。
在第一方面的另一种可能的设计方式中,该方法还包括:电子设备采集电子设备的当前温度。其中,若电子设备的当前温度大于预设温度阈值,则上述m帧第一图像仅包括第二图像,m=1。若电子设备的当前温蒂小于或等于预设温度阈值,则m≥2。应理解,若手机的当前温度大于预设温度阈值时,说明手机的当前温度较高,如果继续高负荷运行可能会影响手机的性能。此时手机可以启用热逃生机制,具体为进行第一图像处理的时候只使用一帧图像(即第二图像),能够降低手机的运行负荷,节省手机的功耗。若手机的当前温度小于或等于该预设温度阈值时,说明手机的当前温度不是很高。此时手机可以基于多帧第一图像得到抓拍图像,能够提升抓拍图像的图像质量。
第二方面,本申请提供一种电子设备,该电子设备包括:触摸屏、存储器、摄像头、显示屏和一个或多个处理器。该触摸屏、存储器、摄像头、显示屏与处理器耦合。其中,存储器中存储有计算机程序代码,计算机程序代码包括计算机指令,当计算机指令被处理器执行时,使得电子设备执行如第一方面及其任一种可能的设计方式所述的方法。
第三方面,本申请提供一种电子设备,该电子设备包括:触摸屏、存储器、摄像头和一个或多个处理器。该触摸屏、存储器、摄像头与处理器耦合。其中,存储器中存储有计算机程序代码,计算机程序代码包括计算机指令,当计算机指令被处理器执行时,使得电子设备执行如下步骤:接收用户的第一操作;其中,第一操作用于触发电子设备开始录制视频;响应于第一操作,摄像头采集第一图像,显示屏显示第一界面;其中,第一界面是电子设备正在录制视频的取景界面,取景界面显示预览流,预览流包括由第一图像得到的预览图像,第一界面还包括抓拍快门,抓拍快门用于触发电子设备抓拍图像得到照片;在第一缓存队列缓存摄像头采集的第一图像;其中,第一缓存队列缓存摄像头采集的n帧第一图像,n≥1,n为整数;响应于用户对抓拍快门的第二操作,根据第一图像的附加信息,从第一缓存队列缓存的n帧第一图像中选择出第二图像;其中,第一图像的附加信息包括第一图像的对比度、摄像头采集第一图像时的角速度和第一图像的时间戳中的至少一个;对包括第二图像的m帧第一图像进行第一图像处理,得到抓拍图像;其中,第一图像处理包括按照目标预览图像的裁剪方式和裁剪参数的裁剪处理,目标预览图像是所述预览流中、电子设备接收到第二操作时摄像头采集的一帧图像m≥1,m为整数;第一图像处理具备提升图像画质的功能。
结合第三方面,在一种可能的设计方式中,上述目标预览图像的裁剪方式包括中 心裁剪方式,目标预览图像的裁剪参数包括目标预览图像的裁剪区域的中心点坐标和裁剪的尺寸信息。
结合第三方面,在另一种可能的设计方式中,m≥2。当计算机指令被处理器执行时,使得电子设备还执行如下步骤:对m帧第一图像进行图像融合,得到第三图像;按照目标预览图像的裁剪方式和裁剪参数,对第三图像进行裁剪处理,得到第四图像;对第四图像进行第二图像处理,得到抓拍图像;其中,第二图像处理包括:图片降噪、亮度及验收校正和图片美化处理中的至少一种。
结合第三方面,在另一种可能的设计中,当计算机指令被处理器执行时,使得电子设备还可以执行如下步骤:获取目标预览图像的逻辑标识,目标预览图像的逻辑标识用于标识采集目标预览图像的摄像头;根据目标预览图像的逻辑标识,从n帧第一图像中确定出包括所述第二图像的所述m帧第一图像;其中,m帧第一图像的逻辑标识与目标预览图像的逻辑标识相同。
结合第三方面,在另一种可能的设计中,m帧第一图像为n帧第一图像中包括第二图像的连续m帧图像;或者,m帧第一图像包括第二图像,以及n帧第一图像中、分辨率大于预设分辨率阈值的m-1帧第一图像;或者,m帧第一图像包括第二图像,以及n帧第一图像中、HDR参数满足预设HDR条件的m-1帧第一图像。
结合第三方面,在另一种可能的设计方式中,第一图像的附加信息包括第一图像的对比度,第一图像的对比度用于表征第一图像的清晰度。第二图像为:第一缓存队列缓存的n帧第一图像中,对比度最高的第一图像。
结合第三方面,在另一种可能的设计方式中,第一图像的附加信息包括摄像头采集第一图像时的角速度,角速度用于表征摄像头采集第一图像时的抖动情况。第二图像为:第一缓存队列缓存的n帧第一图像中,角速度最小的第一图像。
结合第三方面,在另一种可能的设计方式中,第一图像的附加信息还包括第一图像的时间戳;每一帧第一图像中包括时间戳,时间戳记录有电子设备的图像传感器输出对应第一图像的时间。其中,电子设备中上层应用的时钟与图像传感器记录第一图像出图的时钟同步;或者,电子设备中上层应用的时钟与图像传感器记录第一图像出图的时钟为同一系统时钟。
第二图像为:第一缓存队列缓存的n帧第一图像中,时间戳记录的时间与电子设备接收到第二操作的时间最近的第一图像。
结合第三方面,在另一种可能的设计方式中,当计算机指令被处理器执行时,使得电子设备还可以执行如下步骤:采集电子设备的当前温度;其中,若电子设备的当前温度大于预设温度阈值,则m帧第一图像仅包括第二图像,m=1;若电子设备的当前温度小于或等于预设温度阈值,则m≥2。
第四方面,本申请提供一种计算机可读存储介质,该计算机可读存储介质包括计算机指令,当计算机指令在电子设备上运行时,使得电子设备执行如第一方面及其任一种可能的设计方式所述的方法。
第五方面,本申请提供一种计算机程序产品,当该计算机程序产品在计算机上运行时,使得该计算机执行如第一方面及任一种可能的设计方式所述的方法。该计算机 可以是上述电子设备。
可以理解地,上述提供的第二方面和第三方面及其任一种可能的设计方式所述的电子设备,第四方面所述的计算机存储介质,第五方面所述的计算机程序产品所能达到的有益效果,可参考第一方面及其任一种可能的设计方式中的有益效果,此处不再赘述。
附图说明
图1为Sensor输出图像,ISP和ENCODE处理图像得到高清图像的处理流程图;
图2为本申请实施例提供的一种手机的录像取景界面示意图;
图3为本申请实施例提供的一种手机接收抓拍操作到Sensor接收到抓拍指示的延迟时长示意图;
图4A为本申请实施例提供的一种手机从视频流中截取一帧图像作为抓拍图像的方法原理框图;
图4B为本申请实施例提供的一种手机将用户在拍摄过程中得到的拍摄图像作为抓拍图像的方法原理框图;
图4C为本申请实施例提供的一种录像中抓拍图像的方法原理框图;
图4D为本申请实施例提供的另一种录像中抓拍图像的方法原理框图;
图5A为本申请实施例提供的一种电子设备500的结构示意图;
图5B为本申请实施例提供的一种手机的软件架构示意图;
图6为本申请实施例提供的一种录像中抓拍图像的方法流程图;
图7为本申请实施例提供的一种手机的显示界面示意图;
图8为本申请实施例提供的一种第一缓存队列的示意图;
图9为本申请实施例提供的另一种手机的显示界面示意图;
图10为本申请实施例提供的另一种手机的显示界面示意图;
图11为本申请实施例提供的另一种录像中抓拍图像的方法流程图;
图12为本申请实施例提供的另一种录像中抓拍图像的方法原理框图;
图13为本申请实施例提供的另一种录像中抓拍图像的方法流程图;
图14为本申请实施例提供的一种芯片系统的结构示意图。
具体实施方式
为了使本领域普通人员更好地理解本申请的技术方案,下面将结合附图,对本申请实施例中的技术方案进行清楚、完整地描述。
需要说明的是,本申请的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施。以下示例性实施例中所描述的实施方式并不代表与本申请相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本申请的一些方面相一致的装置和方法的例子。
还应当理解的是,术语“包括”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其他特征、整体、步骤、操作、元素和/或组件的存在或添加。在本实施例的描述中,除非另有说明,“多个”的含义是两个或两个 以上。
目前,手机录像过程中,手机的图像传感器(Sensor)受到曝光的控制,可以不断输出拜耳(Bayer)图像。每一帧Bayer图像经过手机的图像信号处理器(image signal processor,ISP)处理,然后经过编码器(ENCODER)进行编码,便可以得到视频流(如预览流或录像流)。
请参考图1,其示出手机录像过程中图像传感器(Sensor)输出图像后,手机中预览流和录像流的处理流程。其中,预览流是指手机录像过程中在显示屏上呈现给用户的视频流,录像流是指录像结束后保存在手机中可供用户查看的视频流。
如图1所示,图像传感器输出图像后,可以由ISP对图像进行图像处理;在ISP的图像处理后,可以分为两路数据流。一路数据流采用图1所示的处理算法1进行处理,然后经过编码器1可编码得到预览流。另一路数据流采用图1所示的处理算法2进行处理,然后经过编码器2可编码得到录像流。
上述处理算法1也可以称为预览流的后处理算法,处理算法2也可以称为录像流的后处理算法。处理算法1和处理算法2可以包括防抖处理、去噪处理、虚化处理、色彩和亮度调整等处理功能。
需要说明的是,手机在录像过程中处理预览流和录像流的方式包括但不限于图1所示的方式。例如,图像传感器输出图像后,可以由ISP对图像进行一部分图像处理(如“RAW域”和“RGB域”的图像处理)。然后,可以分为两路数据流;一路数据流采用处理算法1进行处理,然后由ISP进行另一部分图像处理(如“YUV域”的图像处理),再经过编码器1可编码得到预览流。另一路数据流采用处理算法2进行处理,然后由ISP进行另一部分图像处理(如“YUV域”的图像处理),再经过编码器2可编码得到录像流。以下实施例中,以图1所示的预览流和录像流的处理方式为例,介绍本申请实施例的方法。
应注意,由于Sensor输出图像,ISP和编码器(即ENCODER,如编码器1和编码器2)处理图像都是为了录制视频;因此,可以将Sensor输出图像、ISP和编码器(ENCODER)处理图像的整个过程中的数据流(如预览流和录像流)称为视频流。
手机录像的过程中,手机可以响应于用户的操作抓拍图像。例如,手机可以显示图2所示的录像的取景界面201。该录像的取景界面201包括抓拍快门202,该抓拍快门202用于触发手机抓拍录像过程中的图像并保存成照片。手机响应于用户对图2所示的抓拍快门202的点击操作,便可以抓拍图像。其中,用户希望手机抓拍的是用户点击抓拍快门202那一瞬间,摄像头采集的图像。
为了实现手机录像中抓拍图像,一些技术方案中,可以选取手机的抓拍(Snapshot)程序接收到抓拍指令时,采集的第1帧图像作为抓拍图像(如图3所示的第7帧图像)。但是,上层应用(如图2所示的录像的取景界面201对应的相机应用)接收到用户的抓拍操作(如用户对抓拍快门202的点击操作)后,向Snapshot程序传输抓拍指令需要时间(如图3所示的延迟时长)。在这段时间(如图3所示的延迟时长)内,Sensor并不会停止输出Bayer图像。所以,从上层应用接收到用户的抓拍操作,到Snapshot程序接收到抓拍指令,Sensor可能已经输出了多帧Bayer图像。
例如,如图3所示,假设图像传感器(Sensor)输出第3帧Bayer图像时,上层 应用接收到抓拍操作;Sensor输出第7帧Bayer图像时,抓拍指令传递到Snapshot程序。如此,采用现有技术的方案,因为图3所示的延迟时长,所以第7帧图像并不是用户点击抓拍快门202瞬间的一帧图像。采用该方案,并不能抓拍到用户真实想要的一帧图像。需要说明的是,图3所示的8帧图像中,第1帧图像是Sensor最早出帧的一帧图像,而第8帧图像是Sensor最晚出帧的一帧图像。图像传感器(Sensor)可以从第1帧图像开始,依次曝光输出图3所示的8帧图像。
在另一些实施例中,手机可以截取视频流(如预览流或录像流)中用户抓拍瞬间采集的一帧图像,作为抓拍图像保存成照片展示给用户。
但是,手机录像过程中,每秒需要处理大量图像(如30帧图像)。如此,留给每一帧图像的运算资源和时间都是有限的;因此,手机一般可以使用ISP的硬件处理模块,采用较为简单的处理方式来处理视频流;而不会使用复杂的算法来提升画质(如去噪和提亮)。这样的图像处理效果,只能满足视频的要求;而拍照对画质的要求则更高。因此,截取视频流中的图像,并不能抓拍到用户满意的图像。
例如,如图4A所示,手机的图像传感器(Sensor)输出的每一帧Bayer图像经过手机的图像前端引擎、防抖模块、图像处理引擎以及色彩处理模块,便可以得到预览流。具体的,图像前端引擎可以为图1、图4C以及图4D中的ISP,该图像前端引擎用于对每一帧Bayer图像进行一部分图像处理(如“RAW域”和“RGB域”的图像处理)。防抖模块、图像处理引擎以及色彩处理模块可以为图1、图4C以及图4D中的处理算法1所执行的图像处理对应的功能模块。其中,防抖模块为防抖处理对应的功能模块,用于对每一帧Bayer图像进行防抖处理。图像处理引擎可以为去噪处理对应的功能模块,用于对每一帧Bayer图像进行去噪处理;色彩处理模块为色彩和亮度调整对应的功能模块,用于对每一帧Bayer图像进行色彩和亮度调节等处理。手机可以截取预览流中用户抓拍瞬间采集的一帧图像,作为抓拍图像保存成照片展示给用户。
在另一些实施例中,手机还可以将用户在拍摄过程中得到的拍摄图像,作为抓拍图像保存成照片展示给用户。
但是,用户在拍摄过程中得到的拍摄图像是基于手机(具体为上层应用)中包括的拍摄功能实现的,并非是用户在录像过程中抓拍得到的图像,无法有效展现用户在录像过程中的每一个细节,并不能抓拍到用户满意的图像。
例如,如图4B所示,手机的图像传感器(Sensor)输出的每一帧Bayer图像经过手机的图像处理模块,可以生成拍摄图像。手机可以将该拍摄图像作为抓拍图像保存成照片展示给用户。具体的,图像处理模块可以包括防抖处理、去噪处理、虚化处理、色彩和亮度调整等处理功能,即该图像处理模块用于对每一帧Bayer图像进行防抖处理、去噪处理、虚化处理、色彩和亮度调整等。
本申请实施例提供一种录像中抓拍图像的方法,可以在录像过程中抓拍图像,并且可以提升抓拍图像的图像质量。
一方面,本申请实施例中,如图4C所示,电子设备(如手机)可以将Sensor曝光输出第一图像(即Bayer图像)缓存在一个第一缓存队列(Buffer)中。该Buffer可以缓存多帧第一图像(即Bayer图像)。如此,即使从接收到用户的抓拍操作到Snapshot程序接收到抓拍指令,存在图3所示的延迟时长(如120ms-160ms);在这 段延迟时长内Sensor出帧都可以缓存在Buffer中。因此,手机接收到用户的抓拍操作时,Sensor输出的Bayer图像也可以缓存在第一缓存队列中。并且,短时间内Sensor出帧的图像内容不会发生太大变化;如此,如图4C所示,响应于Snapshot程序接收到抓拍指令,可以由手机的选帧模块从Buffer中选择出图像质量较好的一帧图像(即第二图像)用来生成抓拍图像。这样,可以提升抓拍图像的图像质量。
另一方面,如图4C所示,电子设备还可以对包括第二图像的m帧第一图像进行第一图像处理得到抓拍图像。由于该第一图像处理包括按照目标预览图像的裁剪方式和裁剪参数的裁剪处理;因此电子设备可以按照目标预览图像的裁剪方式和裁剪参数对包括第二图像的m帧第一图像进行裁剪处理,能够得到与目标预览图像的视场角FOV相同的抓拍图像,可以提升抓拍图像的图像质量。
综上所述,采用本申请实施例的方法,可以在录像过程中抓拍到满足用户需求的图像,并且可以提升抓拍图像的图像质量。
另外,如图4D所示,上述第一图像处理还可以包括预设RAW域AI图像增强算法模型(简称为预设RAW域图像处理算法)所执行的图像处理。电子设备可以采用预设RAW域图像处理算法处理选帧模块选择的抓拍帧;最后,由编码器3对处理结果进行编码得到抓拍流。
其中,预设RAW域图像处理算法是一个RAW域的画质增强的深度学习网络。本方案中,采用预设RAW域图像处理算法可以提升抓拍帧的画质。即采用本申请实施例的方法,可以在录像过程中抓拍到满足用户需求的图像,并且可以提升抓拍图像的图像质量。
其中,预设RAW域图像处理算法是一个RAW域的画质增强的深度学习网络。该预设RAW域图像处理算法也可以称为预设画质增强算法、预设画质增强算法模型或者预设RAW域AI模型。
在一些实施例中,预设RAW域图像处理算法可以是软件图像处理算法。该预设RAW域图像处理算法可以是手机的硬件抽象层(hardware abstraction layer,HAL)算法库中的一种软件算法。
在另一些实施例中,预设RAW域图像处理算法可以是硬件图像处理算法。该预设RAW域图像处理算法可以是调用ISP的图像处理算法能力实现的一种硬件图像处理算法。
需要说明的是,预设RAW域图像处理算法也可以称为预设图像处理算法。本申请实施例中之所以称之为预设RAW域图像处理算法,是因为该预设RAW域图像处理算法输入的是RAW域的图像。该预设RAW域图像处理算法输出的可以是RAW域的图像,也可以是RGB域的图像,本申请实施例对此不作限制。
上述编码器1、编码器2和编码器3可以是三个不同的编码器。手机可以采用三个不同的编码器分别对上述预览流、录像流和抓拍流进行编码。或者,上述编码器1、编码器2和编码器3可以是同一个编码器。一个编码器可以包括多个编码单元。手机可以采用一个编码器中三个不同的编码单元分别对上述预览流、录像流和抓拍流进行编码。或者,编码器1和编码器2可以是同一个编码器中不同的两个编码单元,编码器3可以是另一个编码器。
其中,不同编码器的编码方式可以相同,也可以不同。同一编码器的不同编码单元的编码方式可以相同,也可以不同。因此,上述显示模组中的编码器和编码器1输出的图像格式可以相同,也可以不同。例如,显示模组中的编码器和编码器1输出的图像可以是联合图像专家组(Joint Photographic Experts Group,JPEG)、标签图像文件格式(Tag Image File Format,TIFF)等任一种格式的图像。
图1、图4A、图4B、图4C或图4D所示的图像传感器(Sensor)输出的图像为拜耳(Bayer)格式的图像(简称Bayer图像)。其中,Bayer、JPEG和TIFF是图像的三种表达格式。Bayer图像和JPEG图像的详细介绍可以参考常规技术中的相关内容,这里不予赘述。
示例性的,本申请实施例中的电子设备可以是手机、平板电脑、智能手表、桌面型、膝上型、手持计算机、笔记本电脑、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本,以及蜂窝电话、个人数字助理(personal digital assistant,PDA)、增强现实(augmented reality,AR)\虚拟现实(virtual reality,VR)设备等包括摄像头的设备,本申请实施例对该电子设备的具体形态不作特殊限制。
下面将结合附图对本申请实施例的实施方式进行详细描述。请参考图5A,为本申请实施例提供的一种电子设备500的结构示意图。如图5A所示,电子设备500可以包括:处理器510,外部存储器接口520,内部存储器521,通用串行总线(universal serial bus,USB)接口530,充电管理模块540,电源管理模块541,电池542,天线1,天线2,移动通信模块550,无线通信模块560,音频模块570,扬声器570A,受话器570B,麦克风570C,耳机接口570D,传感器模块580,按键590,马达591,指示器592,摄像头593,显示屏594,以及用户标识模块(subscriber identification module,SIM)卡接口595等。
其中,上述传感器模块580可以包括压力传感器,陀螺仪传感器,气压传感器,磁传感器,加速度传感器,距离传感器,接近光传感器,指纹传感器,温度传感器,触摸传感器,环境光传感器和骨传导传感器等传感器。
可以理解的是,本实施例示意的结构并不构成对电子设备500的具体限定。在另一些实施例中,电子设备500可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。
处理器510可以包括一个或多个处理单元,例如:处理器510可以包括应用处理器(application processor,AP),调制解调处理器,GPU,图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或NPU等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。
控制器可以是电子设备500的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。
处理器510中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器510中的存储器为高速缓冲存储器。该存储器可以保存处理器510刚用过或循环使用的指令或数据。如果处理器510需要再次使用该指令或数据,可从所述存储器中直 接调用。避免了重复存取,减少了处理器510的等待时间,因而提高了系统的效率。
在一些实施例中,处理器510可以包括一个或多个接口。可以理解的是,本实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对电子设备500的结构限定。在另一些实施例中,电子设备500也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。
充电管理模块540用于从充电器接收充电输入。充电管理模块540为电池542充电的同时,还可以通过电源管理模块541为电子设备供电。
电源管理模块541用于连接电池542、充电管理模块540与处理器510。电源管理模块541接收电池542和/或充电管理模块540的输入,为处理器510,内部存储器521,外部存储器,显示屏594,摄像头593,和无线通信模块560等供电。
电子设备500的无线通信功能可以通过天线1,天线2,移动通信模块550,无线通信模块560,调制解调处理器以及基带处理器等实现。
天线1和天线2用于发射和接收电磁波信号。在一些实施例中,电子设备500的天线1和移动通信模块550耦合,天线2和无线通信模块560耦合,使得电子设备500可以通过无线通信技术与网络以及其他设备通信。
电子设备500通过GPU,显示屏594,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏594和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器510可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。
显示屏594用于显示图像,视频等。该显示屏594包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。
电子设备500可以通过ISP,摄像头593,视频编解码器,GPU,显示屏594以及应用处理器等实现拍摄功能。
ISP用于处理摄像头593反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头593中。
摄像头593用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,电子设备500可以包括N个摄像头593,N为大于1的正整数。
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当电子设备500在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。
视频编解码器用于对数字视频压缩或解压缩。电子设备500可以支持一种或多种视频编解码器。这样,电子设备500可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备500的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。
外部存储器接口520可以用于连接外部存储卡,例如Micro SD卡,实现扩展电子设备500的存储能力。外部存储卡通过外部存储器接口520与处理器510通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。
内部存储器521可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器510通过运行存储在内部存储器521的指令,从而执行电子设备500的各种功能应用以及数据处理。例如,在本申请实施例中,处理器510可以通过执行存储在内部存储器521中的指令,内部存储器521可以包括存储程序区和存储数据区。
其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储电子设备500使用过程中所创建的数据(比如音频数据,电话本等)等。此外,内部存储器521可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等。
电子设备500可以通过音频模块570,扬声器570A,受话器570B,麦克风570C,耳机接口570D,以及应用处理器等实现音频功能。例如音乐播放,录音等。
按键590包括开机键,音量键等。马达591可以产生振动提示。指示器592可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。
SIM卡接口595用于连接SIM卡。SIM卡可以通过插入SIM卡接口595,或从SIM卡接口595拔出,实现和电子设备500的接触和分离。电子设备500可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口595可以支持Nano SIM卡,Micro SIM卡,SIM卡等。
以下实施例中的方法均可以在具有上述硬件结构的电子设备500中实现。以下实施例中,以电子设备500是手机为例,介绍本申请实施例的方法。图5B是本申请实施例的手机的软件结构框图。
分层架构将软件分成若干个层,每一层都有清晰的角色和分工。层与层之间通过软件接口通信。在一些实施例中,将AndroidTM系统分为五层,从上至下分别为应用程序层,应用程序框架层,安卓运行时(Android runtime)和系统库,硬件抽象层(hardware abstraction layer,HAL)以及内核层。应理解:本文以Android系统举例来说明,在其他操作系统中(例如鸿蒙TM系统,IOSTM系统等),只要各个功能模块 实现的功能和本申请的实施例类似也能实现本申请的方案。
应用程序层可以包括一系列应用程序包。
如图5B所示,应用程序层中可以安装通话,游戏,相机,导航,浏览器,日历,地图,蓝牙,音乐,视频等应用。
在本申请实施例中,应用程序层中可以安装具有拍摄功能的应用,例如,相机应用。当然,其他应用需要使用拍摄功能时,也可以调用相机应用实现拍摄功能。
应用程序框架层为应用程序层的应用程序提供应用编程接口(application programming interface,API)和编程框架。应用程序框架层包括一些预先定义的函数。
例如,应用程序框架层可以包括窗口管理器,内容提供器,视图系统,资源管理器,通知管理器等,本申请实施例对此不做任何限制。
例如,上述窗口管理器用于管理窗口程序。窗口管理器可以获取显示屏大小,判断是否有状态栏,锁定屏幕,截取屏幕等。上述内容提供器用来存放和获取数据,并使这些数据可以被应用程序访问。所述数据可以包括视频,图像,音频,拨打和接听的电话,浏览历史和书签,电话簿等。上述视图系统可用于构建应用程序的显示界面。每个显示界面可以由一个或多个控件组成。一般而言,控件可以包括图标、按钮、菜单、选项卡、文本框、对话框、状态栏、导航栏、微件(Widget)等界面元素。上述资源管理器为应用程序提供各种资源,比如本地化字符串,图标,图片,布局文件,视频文件等等。上述通知管理器使应用程序可以在状态栏中显示通知信息,可以用于传达告知类型的消息,可以短暂停留后自动消失,无需用户交互。比如通知管理器被用于告知下载完成,消息提醒等。通知管理器还可以是以图表或者滚动条文本形式出现在系统顶部状态栏的通知,例如后台运行的应用程序的通知,还可以是以对话窗口形式出现在屏幕上的通知。例如在状态栏提示文本信息,发出提示音,振动,指示灯闪烁等。
如图5B所示,Android runtime包括核心库和虚拟机。Android runtime负责安卓系统的调度和管理。
核心库包含两部分:一部分是java语言需要调用的功能函数,另一部分是安卓的核心库。
应用程序层和应用程序框架层运行在虚拟机中。虚拟机将应用程序层和应用程序框架层的java文件执行为二进制文件。虚拟机用于执行对象生命周期的管理,堆栈管理,线程管理,安全和异常的管理,以及垃圾回收等功能。
系统库可以包括多个功能模块。例如:表面管理器(surface manager),媒体库(Media Libraries),三维图形处理库(例如:OpenGL ES),2D图形引擎(例如:SGL)等。
其中,表面管理器用于对显示子系统进行管理,并且为多个应用程序提供了2D和3D图层的融合。媒体库支持多种常用的音频,视频格式回放和录制,以及静态图像文件等。媒体库可以支持多种音视频编码格式,例如:MPEG4,H.254,MP3,AAC,AMR,JPG,PNG等。三维图形处理库用于实现三维图形绘图,图像渲染,合成,和图层处理等。2D图形引擎是2D绘图的绘图引擎。
内核层位于HAL之下,是硬件和软件之间的层。内核层至少包含显示驱动,摄像头驱动,音频驱动,传感器驱动等,本申请实施例对此不做任何限制。
在本申请实施例中,仍如图5B所示,以相机应用举例,可应用程序框架层中设置有相机服务(Camera Service)。相机应用可通过调用预设的API启动Camera Service。Camera Service在运行过程中可以与硬件抽象层(HAL)中的Camera HAL交互。其中,Camera HAL负责与手机中实现拍摄功能的硬件设备(例如摄像头)进行交互,Camera HAL一方面隐藏了相关硬件设备的实现细节(例如具体的图像处理算法),另一方面可向Android系统提供调用相关硬件设备的接口。
示例性的,相机应用运行时可将用户下发的相关控制命令(例如预览、放大、拍照、录像或者抓拍指令)发送至Camera Service。一方面,Camera Service可将接收到的控制命令发送至Camera HAL,使得Camera HAL可根据接收到的控制命令调用内核层中的相机驱动,由相机驱动来驱动摄像头等硬件设备响应该控制命令采集图像数据。例如,摄像头可按照一定的帧率,将采集到的每一帧图像数据通过相机驱动传递给Camera HAL。其中,控制命令在操作系统内部的传递过程可参见图5B中控制流的具体传递过程。
另一方面,Camera Service接收到上述控制命令后,可根据接收到的控制命令确定此时的拍摄策略,拍摄策略中设置了需要对采集到的图像数据执行的具体图像处理任务。例如,在预览模式下,Camera Service可在拍摄策略中设置图像处理任务1用于实现人脸检测功能。又例如,如果在预览模式下用户开启了美颜功能,则Camera Service还可以在拍摄策略中设置图像处理任务2用于实现美颜功能。进而,Camera Service可将确定出的拍摄策略发送至Camera HAL。
当Camera HAL接收到摄像头采集到的每一帧图像数据后,可根据Camera Service下发的拍摄策略对上述图像数据执行相应的图像处理任务,得到图像处理后的每一帧拍摄画面。例如,Camera HAL可根据拍摄策略1对接收到的每一帧图像数据执行图像处理任务1,得到对应的每一帧拍摄画面。当拍摄策略1更新为拍摄策略2后,Camera HAL可根据拍摄策略2对接收到的每一帧图像数据执行图像处理任务2,得到对应的每一帧拍摄画面。
后续,Camera HAL可将经过图像处理后的每一帧拍摄画面通过Camera Service上报给相机应用,相机应用可将每一帧拍摄画面显示在显示界面中,或者,相机应用以照片或视频的形式将每一帧拍摄画面保存在手机内。其中,上述拍摄画面在操作系统内部的传递过程可参见图5B中数据流的具体传递过程。
本申请实施例这里结合图5B介绍手机中各个软件层实现本申请实施例的方法的工作原理。
相机应用在录像模式下运行时,可将用户下发的抓拍指令发送至Camera Service。在录像模式下,Camera HAL可根据之前接收到的录像指令调用内核层中的相机驱动,由相机驱动来驱动摄像头等硬件设备响应该录像指令采集图像数据。例如,摄像头可按照一定的帧率,将采集到的每一帧图像数据通过相机驱动传递给Camera HAL。其中,基于录像指令由相机驱动传递给Camera HAL的每一帧图像组成的数据流可以为本申请实施例中所述的视频流(如预览流和录像流)。
另外,Camera Service接收到上述抓拍指令后,可根据接收到的抓拍指令确定此时的拍摄策略3为录像中抓拍图像。该拍摄策略中设置了需要对采集到的图像数据执行的具体图像处理任务3,该图像处理任务3中包括按照目标预览图像的裁剪方式和裁剪参数的裁剪处理,该图像处理任务3用于实现录像中抓拍功能。进而,Camera Service可将确定出的拍摄策略3发送至Camera HAL。
当Camera HAL接收到摄像头采集到的每一帧图像数据后,可根据Camera Service下发的拍摄策略3对上述图像数据执行相应的图像处理任务3,如按照目标预览图像的裁剪方式和裁剪参数对包括第二图像的m帧第一图像进行裁剪处理,得到对应的抓拍图像。
应注意,本申请实施例中,摄像头的图像传感器(Sensor)曝光输出的每一帧图像(即第一图像)可以缓存在第一缓存队列(Buffer)中。其中,第一缓存队列(Buffer)可以设置在手机软件系统的任何一层,如第一缓存队列(Buffer)可以设置在Camera HAL通过软件接口访问的内存区域。Camera HAL响应于抓拍指令,可以根据Buffer中缓存的多帧Bayer图像(即第一图像)的元数据,从该Buffer中选择出抓拍帧。如此,手机则可以从第一缓存队列中得到图像质量较高的抓拍帧。示例性的,第一图像的附加信息可以包括第一图像的对比度和摄像头采集第一图像时的角速度。应理解,角速度越小,摄像头采集第一图像时抖动越小;角速度越大,摄像头采集第一图像时抖动越大。对比度用于表征第一图像的清晰度。对比度越高,第一图像越清晰。如此,根据第一图像的附加信息,则可以从Buffer中缓存的多帧Bayer图像(即第一图像)中选择出抖动小,且图像清晰度最大的Bayer图像作为抓拍帧。
其中,Buffer中缓存的每一帧Bayer图像的附加信息可以是硬件层的摄像头的ISP为Buffer中的每一帧Bayer图像的元数据(metadata)赋值得到的。其中,ISP按照功能可以分为统计模块和处理模块。统计模块可以包括(image front end,IFE),处理模块可以包括(image processing engine,IPE)和(bayer processing segment,BPS)。上述Bayer图像的附加信息可以是由ISP的统计模块为Buffer中的每一帧Bayer图像的元数据赋值得到的。ISP的处理模块用于处理Sensor曝光输出的图像。
上述角速度可以是电子设备中的陀螺仪采集到的。在本申请实施例中,HAL中保存有用于调度陀螺仪的软件代码。Camera HAL响应于录像指令,可以调用内核层中的陀螺仪驱动,由陀螺仪驱动来驱动陀螺仪采集电子设备的角速度。其中,电子设备的角速度即摄像头的角速度。摄像头在不同时刻的角速度可能不同,摄像头的Sensor在不同时刻可以曝光输出不同的Bayer图像。并且,Camera HAL响应于录像指令,还可以调用内核层中的相机驱动,由相机驱动来驱动摄像头中的ISP的统计模块将陀螺仪采集的角速度写入Sensor输出的Bayer图像的元数据。
其中,Bayer图像的附加信息中还包括Sensor曝光输出该Bayer图像的时间。ISP的统计模块可以根据角速度的采集时间,以及Bayer图像的曝光时间,确定该Bayer图像的角速度,并将该Bayer图像的角速度写入该Bayer图像的元数据中。并且,ISP的统计模块还可以分析该Bayer图像,得到该Bayer图像的对比度,并将该Bayer图像的对比度写入该Bayer图像的元数据中。
需要说明的是,在一些平台,可以将Sensor曝光结束时间作为时间戳;在另一些 平台可以将Sensor开始曝光时间作为时间戳,本申请实施例对此不作限制。其中,上述曝光结束时间和开始曝光时间统称为曝光时间。
其中,ISP的统计模块可以通过一个第一预设接口,如第一摄像头串行接口(camera serial interface,CSI),将Buffer中每个Bayer图像的角速度和对比度写入对应Bayer图像的元数据中。其中,上述第一预设CSI可以是Sensor与Buffer之间的一个软件接口。
其中,Camera HAL还可以通过第二预设接口(如第二预设CSI)对包括第二图像的m帧第一图像进行第一图像处理(包括裁剪处理),得到的抓拍图像。其中,上述第二预设CSI可以为Buffer与执行第一图像处理的模块之间的一个软件接口。然后,Camera HAL可以将抓拍图像通过Camera Service上报给相机应用,相机应用可将抓拍图像以照片的形式保存在手机内。
可选地,Camera HAL中还可以包括预设RAW域图像处理算法。Camera HAL可以通过第二预设接口(如第二预设CSI)调用预设RAW域图像处理算法处理该抓拍帧和该抓拍帧的相邻帧,得到处理后的图像帧。其中,上述第二预设CSI可以是Buffer与预设RAW域图像处理算法之间的一个软件接口。然后,Camera HAL可以调用编码器(ENCODE)对该图像帧进行编码,便可以得到一帧抓拍图像。同理,Camera HAL可将抓拍图像通过Camera Service上报给相机应用,相机应用可将抓拍图像以照片的形式保存在手机内。
在一些实施例中,Camera HAL中还包括一个感知模块。选帧模块选择出抓拍帧后,感知模块可以根据抓拍帧和该抓拍帧的相邻帧,确定手机是否处于高动态(high dynamic range,HDR)场景。其中,在HDR场景与非HDR场景下,预设RAW域图像处理算法执行不同的图像处理流程。HDR场景与非HDR场景下,预设RAW域图像处理算法的图像处理流程,可以参考以下实施例中的详细介绍,这里不予赘述。
本申请实施例提供一种录像中抓拍图像的方法,该方法可以应用于手机,该手机包括摄像头。如图6所示,该方法可以包括S601-S605。
S601、手机接收用户的第一操作。该第一操作用于触发手机开始录制视频。
示例性的,手机可以显示图7所示的录像的取景界面701。该录像的取景界面701是手机还未开始录像的取景界面。该录像的取景界面701包括“开始录像”按钮702。上述第一操作可以是用户对“开始录像”按钮702的点击操作,用于触发手机开始录制视频。
S602、响应于第一操作,手机的摄像头采集第一图像,手机显示第一界面。该第一界面是手机正在录制视频的取景界面,该取景界面显示预览流,该预览流包括由第一图像得到的预览图像。该第一界面还包括抓拍快门,该抓拍快门用于触发手机抓拍图像得到照片。
示例性的,以第一操作是用户对“开始录像”按钮702的点击操作为例。手机响应于用户对“开始录像”按钮702的点击操作,手机的摄像头可以开始采集图像(即第一图像),手机的显示屏可显示图7所示的第一界面703。该第一界面703是手机正在录制视频的取景界面。如图7所示,该第一界面703包括由上述第一图像得到的预览图像704。其中,多帧预览图像704可以组成图1、图4A、图4C或图4D所示的 预览流。
其中,本申请实施例这里介绍手机由第一图像得到预览图像704的方法。在S602中,手机的摄像头采集第一图像之后,手机显示第一界面之前,手机可以按照图1、图4A、图4C或图4D所示的预览流的处理方式处理该第一图像,得到预览图像704。应注意,手机的ISP可以采用ISP处理摄像头采集的每一帧第一图像。
例如,手机由第一图像得到预览图像704的方法,可以参考图4C或图4D所示“预览流”的处理方法。
如图4C或图4D所示,手机的图像传感器(Sensor)受到曝光的控制,可以不断输出Bayer图像。每一帧Bayer图像由手机的ISP进行图像处理后,送至编码器1(ENCODER)进行编码,便可以得到预览图像704。处理后的多帧预览图像704可以形成一段预览的视频流(即预览流)。
需要强调的是,如图7所示,第一界面703还包括抓拍快门702。该抓拍快门702用于触发手机抓拍图像得到照片。具体的,该抓拍快门702用于触发手机在录像的过程中抓拍图像得到照片。可以想到的是,手机录制视频(即录像)的过程中,可能会采集到的一些精彩的画面。在手机录像的过程中,用户可能会希望手机可以抓拍到上述精彩的画面,并保存成照片展示给用户。本申请实施例中,用户点击上述抓拍快门702便可以实现录像过程中抓拍精彩画面的功能。
为了保证手机响应于用户的抓拍操作(如用户对抓拍快门702的点击操作),可以抓拍到用户实际需要的图像;手机可以将Sensor曝光输出Bayer图像缓存在一个第一缓存队列(Buffer)中。如此,即使从接收到用户的抓拍操作到Sensor接收到抓拍指令,存在图3所示的延迟时长(如120ms-160ms);在这段延迟时长内Sensor出帧都可以缓存在Buffer中。因此,手机接收到用户的抓拍操作时,Sensor输出的Bayer图像也可以缓存在第一缓存队列中。并且,短时间内Sensor出帧的图像内容不会发生太大变化;因此,手机可以从Buffer中选择出图像质量较好的一帧图像作为抓拍图像。具体的,响应于上述第一操作,手机还可以执行S603。
S603、手机在第一缓存队列缓存摄像头采集的第一图像。该第一缓存队列缓存摄像头采集的n帧第一图像,n≥1,n为整数。
例如,手机响应于上述第一操作,手机可以在图8所示的第一缓存队列(Buffer)中缓存摄像头采集的第一图像。应注意,该第一缓存队列可以以先进先出的原则缓存摄像头采集的n帧第一图像。如图8所示,第一缓存队列的队尾可以执行入队操作,用于插入第一图像;第一缓存队列的队头可以执行出队操作,用于删除第一图像。在第一缓存队列中已缓存n帧第一图像的情况下,第一缓存队列的队尾每插入一帧第一图像,第一缓存队列的队头则删除一帧第一图像。
在一些实施例中,n可以等于1。在这种情况下,第一缓存队列中可以缓存一帧第一图像。如此,手机在执行S604-S605时,电子设备只能对一帧第一图像(即第二图像)进行第一图像处理,得到抓拍图像。
在另一些实施例中,n可以大于1。在这种情况下,第一缓存队列中可以缓存多帧第一图像。如此,手机在执行S604-S605时,电子设备可以对一帧第一图像进行图像处理得到抓拍图像,也可以对多帧第一图像进行图像处理得到抓拍图像。其中,电子 设备对多帧第一图像进行图像处理得到抓拍图像,可以对抓拍帧(即参考帧)起到画质增强的作用,有利于获取噪声和纹理等信息,可以进一步提升输出图像的画质。
本申请实施例中,n可以为预设数值。假设Sensor每秒钟可以曝光a帧Bayer图像,图3所示的延迟时长为b,则Sensor在延迟时长b内可以曝光出b/(1/a)=a*b帧Bayer图像。n可以大于或者等于a*b。
S604、手机响应于用户对抓拍快门的第二操作,根据第一图像的附加信息,从第一缓存队列缓存的n帧第一图像中选择出第二图像。其中,第一图像的附加信息包括该第一图像的对比度、摄像头采集该第一图像时的角速度和该第一图像的时间戳中的至少一个。
示例性的,上述第二操作可以是用户对抓拍快门的单击操作。例如,第二操作可以是用户对图7所示的抓拍快门的单击操作。或者,第二操作可以是用户对抓拍快门的连续点击操作。其中,对抓拍快门的每次单击操作,用于触发手机执行一次以下操作:“根据第一图像的附加信息,从第一缓存队列缓存的n帧第一图像中选择出第二图像”,以及S605。也就是说,对抓拍快门的单击操作用于触发手机抓拍一张照片。对抓拍快门的连续点击操作用于触发手机抓拍多张照片。其中,手机在录像过程中抓拍多张照片的方法与抓拍一张照片的方法类似,这里不予赘述。
本申请实施例中,如图5B所示,手机的HAL中的Camera HAL可以包括一个选帧模块。Camera HAL接收到来自Camera Service的抓拍指令后,选帧模块可以根据第一图像的附加信息,从第一缓存队列(Buffer)中缓存的n帧第一图像中选择出第二图像(即抓拍帧,也称为参考帧)。其中,第一图像的附加信息包括该第一图像的对比度、摄像头采集该第一图像时的角速度(简称为第一图像的角速度)以及该第一图像的时间戳中的至少一个。
其中,对比度也可以称为梯度。一帧图像的梯度越大,则该图像越清晰。梯度也可以称为锐度。一帧图像的锐度越大,则该图像越清晰。也就是说,对比度可以用于表征第一图像的清晰度。一帧图像(如第一图像)的对比度越高,则该图像越清晰。
角速度可以是陀螺仪传感器采集的。一帧图像(即第一图像)的角速度的值可以表征摄像头(如摄像头的Sensor)采集该图像时的角速度的大小。角速度越小,摄像头采集第一图像时抖动越小;角速度越大,摄像头采集第一图像时抖动越大。
如此,手机(如手机的HAL中的选帧模块)根据第一图像的附加信息,则可以从第一缓存队列(Buffer)中缓存的多帧Bayer图像(即第一图像)中选择出抖动小,且图像清晰度最大的Bayer图像作为抓拍帧(即第二图像)。例如,手机(如手机的HAL中的选帧模块)可以遍历Buffer中缓存的n帧第一图像,根据第一图像的附加信息,从Buffer中缓存的n帧第一图像中选择出抖动小,且图像清晰度最大的第一图像作为第二图像。
在一些实施例中,手机根据第一图像的附加信息,从第一缓存队列(Buffer)缓存的n帧第一图像中选择出第二图像的方法可以包括Sa。Sa:手机从第一缓存队列(Buffer)缓存的n帧第一图像中,选择对比度最大的第一图像。
在一种情况下,第一缓存队列(Buffer)缓存的n帧第一图像中,一帧第一图像的对比度最大,大于这一帧第一图像之外的其他第一图像的对比度。在这种情况下, 手机可以将对比度最大的这一帧第一图像作为第二图像。
在另一种情况下,第一缓存队列(Buffer)缓存的n帧第一图像中,可能包括至少两帧对比度相同的第一图像。并且,该至少两帧第一图像的对比度大于n帧第一图像中其他第一图像的对比度。在这种情况下,手机还可以执行Sb。Sb:手机从该至少两帧第一图像中,选择角速度最小的第一图像作为第二图像。
需要说明的是,Sensor曝光的Bayer图像(即第一图像)的元数据中是不包括上述附加信息的。第一图像的附加信息可以是由ISP的统计模块为第一缓存队列(Buffer)中的每一帧Bayer图像的元数据赋值得到的。
然后,手机中的选帧模块可以根据第一图像的附加信息,从第一缓存队列缓存的n帧第一图像中选择出第二图像。
在一些实施例中,上述每一帧第一图像中包括时间戳,该时间戳记录有图像传感器Sensor输出对应第一图像的时间(即曝光时间)。该时间戳也可以包括在第一图像的元数据中。
其中,手机中上层应用的时钟与Sensor记录第一图像出图的时钟同步;或者,手机中上层应用的时钟与Sensor记录第一图像出图的时钟为同一系统时钟。手机接收到第二操作(即抓拍操作)的时间可以由手机的应用层记录。如此,在手机中上层应用的系统时钟与Sensor的系统时钟同步的前提下,手机(如ISP的统计模块)则可以根据陀螺仪传感器采集每一个角速度的时间,选择采集时间与第一图像的时间戳所记录的曝光时间最近的一个角速度作为该第一图像的角速度。然后,ISP的统计模块可以将该第一图像的角速度写入该第一图像的元数据。
需要说明的是,在一些平台,可以将Sensor曝光结束时间作为时间戳;在另一些平台可以将Sensor开始曝光时间作为时间戳,本申请实施例对此不作限制。其中,上述曝光结束时间和开始曝光时间统称为曝光时间。
在另一些实施例中,如果第一缓存队列(Buffer)缓存的n帧第一图像的附加信息均相同,手机(如手机的HAL中的选帧模块)则可以根据第一图像的时间戳,从第一缓存队列(Buffer)缓存的n帧第一图像中,选择出时间戳所指示的时间与用户触发抓拍的时间最近的一帧第一图像作为抓拍帧(即第二图像)。
在另一些实施例中,手机(如手机的HAL中的选帧模块)在执行“从第一缓存队列(Buffer)缓存的n帧第一图像中选择出第二图像”之前,可以对第一图像进行异常判断,丢弃Buffer中的异常帧(即异常的第一图像),从Buffer中正常的第一图像中选择出第二图像。
其中,手机(如手机的HAL中的选帧模块)可以对比一帧第一图像(记为图像帧a)的曝光时间,以及与该图像帧a的前一帧第一图像(记为图像帧b)的曝光时间,判断图像帧a是否异常。应理解,Sensor曝光输出每一帧Bayer图像的曝光时间一般不会发生较大变化,如相邻图像帧的曝光时间不会突然变的很高,或者相邻图像帧的曝光时间不会突然变的很低。例如,相邻图像帧的曝光时间的差值一般不会超过10毫秒(ms),该差值最大不会超过20ms。因此,如果图像帧a的曝光时间与图像帧b的曝光时间的差值大于预设曝光阈值,则表示该图像帧a异常。该预设曝光阈值可以小于20ms,在10ms左右取值。例如,该预设曝光阈值可以为10ms、9ms、11ms或者 8ms等。
其中,手机(如手机的HAL中的选帧模块)从Buffer中正常的第一图像中选择出第二图像的方法,可以参考上述实施例中所述的方法,本申请实施例这里不予赘述。
S605、手机对包括第二图像的m帧第一图像进行第一图像处理,得到抓拍图像。该第一图像处理包括按照目标预览图像的裁剪方式和裁剪参数的参见处理,该目标预览图像是上述预览流中,手机接收到第二操作时摄像头采集的一帧图像,m≥1,m为整数。
在一些实施例中,m可以等于1。也就是说,m帧第一图像是上述第二图像。即手机对上述第二图像进行第一图像处理,便可以得到画质较高的抓拍图像。但是,一帧图像中的数据的完整性和纹理等参数均有限,手机只对一帧图像进行第一图像处理,可能并不能有效提升这一帧图像的画质。
基于此,在另一些实施例中,m可以大于1。具体的,手机可以对该第二图像以及m-1帧第一图像进行第一图像处理。即可以对n帧第一图像中、包括该第二图像在内的m帧第一图像进行第一图像处理。应理解,m帧第一图像中除第二图像之外的其他图像(即m-1帧第一图像),可以对抓拍帧(即参考帧)起到画质增强的作用,有利于获取噪声和纹理等信息,可以进一步提升抓拍图像的画质。
在一种情况下,上述m帧第一图像为n帧第一图像中包括该第二图像的连续的m帧图像。即该m帧第一图像为包括该第二图像在内的相邻的m帧第一图像。相邻图像的图像内容的相似度更高,因此采用连续的m帧图像得到抓拍图像,更有利于提升抓拍图像的图像质量。
在另一种情况下,该m帧第一图像包括该第二图像,以及n帧第一图像中、分辨率大于预设分辨率阈值的m-1帧第一图像。具体的,当某一帧第一图像的分辨率大于预设分辨率阈值时,说明该第一图像的分辨率较大,也可以理解为该第一图像的清晰度较高,此时手机可以确定该第一图像为上述m帧第一图像(具体为m-1帧第一图像)中包括的一帧图像。即上述m-1帧第一图像为n帧第一图像中分辨率较大的第一图像。因此,采用分辨率较大的第一图像得到抓拍图像,更有利于提升抓拍图像的图像质量。
在另一种情况下,该m帧第一图像包括该第二图像,以及n帧第一图像中、高动态范围(high dynamic range,HDR)参数满足预设HDR条件的m-1帧第一图像。具体的,当某一帧第一图像的HDR参数满足预设HDR条件时,说明该第一图像拥有很高的亮度动态范围以及更加丰富的色彩,也可以理解为该第一图像的画质较高,此时手机可以确定该第一图像为上述m帧第一图像(具体为m-1帧第一图像)中包括的一帧图像。即上述m-1帧第一图像为n帧第一图像中画质较高的第一图像。因此,采用画质较高的第一图像得到抓拍图像,更有利于提升抓拍图像的图像质量。
结合上述实施例的描述,应理解,当m≥2时,本申请实施例中所述的第一图像处理是一个多帧输入、单帧输出的图像处理过程。电子设备对包括第二图像的m帧第一图像进行第一图像处理,可以提升抓拍图像的画质,能够提升抓拍图像的图像质量。
可以理解的是,上述第一图像处理是电子设备基于软件的方式实现的。基于软件的方式实现对包括第二图像的m帧第一图像进行第一图像处理的过程,可以提升图像的处理效率,节省手机的功耗。
在一些实施例中,上述目标预览图像的裁剪方式包括中心裁剪方式,该目标预览图像的裁剪参数包括该目标预览图像的裁剪区域的中心点坐标和裁剪的尺寸信息。
应理解,一帧预览图像与一帧第一图像可能是全尺寸(full size)图像,手机可以按照对应的裁剪方式和裁剪参数对预览图像和第一图像进行裁剪处理。
在一种可选的技术方案中,上述裁剪的尺寸信息可以为被裁剪区域的尺寸信息,也可以为剩余区域的尺寸信息,该剩余区域为对第一图像进行裁剪处理之后得到的图像。可选地,该裁剪的尺寸信息包括宽高信息。
需要说明的是,本申请实施例中,手机可以通过时分复用的方式,采用ISP处理第一图像得到预览图像,采用ISP(如ISP的统计模块)为第一图像的元数据赋值,以及ISP对包括第二图像的m帧第一图像进行第一图像处理。也就是说,ISP(如ISP的统计模块)为第一图像的元数据赋值,ISP对包括第二图像的m帧第一图像进行第一图像处理,并不会影响手机采用ISP处理第一图像得到预览图像。换言之,手机处理图4C或图4D所示的抓拍流,并不会影响手机处理视频流(如预览流和录像流)。
可选地,本申请实施例中所述的预设RAW域图像处理算法是一个多帧输入、单帧输出的神经网络模型。其中,预设RAW域图像处理算法是一个RAW域的画质增强的深度学习网络。本方案中,采用预设RAW域图像处理算法可以提升抓拍图像的画质,有助于提升抓拍图像的图像质量。
示例性的,响应于用户对图7所示的抓拍快门的单击操作(即第二操作),手机可以生成并保存抓拍照片。但是,手机在录像过程中,用户并不能查看该抓拍照片。用户可以在录像结束后,在相册中查看该抓拍照片。例如,手机响应于用户对图9所示“结束录像”按钮706的点击操作,可以显示图9所示的录像的取景界面901。录像的取景界面901是手机未开始录像的取景界面。与图7所示的录像的取景界面701相比,手机的取景界面中的照片选项中的照片由图7所示的708更新为图9所示的902。手机可以响应于用户对相册应用的启动操作,显示图10所示的相册列表界面1001,该相册列表界面1001包括手机中保存的多张照片和视频。例如,如图7所示,相册列表界面1001包括手机录制的视频1003,以及手机在录制视频1003过程中抓拍的照片1002。
本申请实施例中,手机可以将Sensor曝光输出Bayer图像缓存在一个第一缓存队列(Buffer)中。该第一缓存队列可以缓存多帧Bayer图像。如此,即使从接收到用户的抓拍操作到Sensor接收到抓拍指令,存在图3所示的延迟时长;在这段延迟时长内Sensor出帧都可以缓存在Buffer中。因此,手机接收到用户的抓拍操作时,Sensor输出的Bayer图像也可以缓存在第一缓存队列中。并且,短时间内Sensor出帧的图像内容不会发生太大变化;因此,手机可以从Buffer中选择出图像质量较好的一帧图像作为抓拍图像。
并且,手机还可以对包括第二图像的m帧第一图像进行第一图像处理得到抓拍图像。由于该第一图像处理包括按照目标预览图像的裁剪方式和裁剪参数的裁剪处理;因此手机可以按照目标预览图像的裁剪方式和裁剪参数对包括第二图像的m帧第一图像进行裁剪处理,能够得到与目标预览图像的FOV相同的抓拍图像,可以提升抓拍图像的图像质量。
综上所述,采用本申请实施例的方法,可以在录像过程中抓拍到满足用户需求的图像,并且可以提升抓拍图像的图像质量。
在本申请实施例的一种实现方式中,上述m≥2。结合图6,如图11所示,手机对包括第二图像的m帧第一图像进行第一图像处理,得到抓拍图像(即S605),具体可以包括S1101-S1103。
S1101、手机对m帧第一图像进行图像融合,得到第三图像。
具体的,手机可以基于融合算法对多帧第一图像进行图像融合。应理解,通过对m帧第一图像进行图像融合可以对抓拍帧(即参考帧)起到画质增强的作用,有利于获取噪声和纹理等信息,可以进一步提升抓拍图像的画质。
S1102、手机按照目标预览图像的裁剪方式和裁剪参数,对第三图像进行裁剪处理,得到第四图像。
可以理解的是,手机m帧第一图像进行图像融合得到第三图像为基于多帧图像生成一帧图像的过程。如此,手机只需要对一帧图像(即第三图像)进行裁剪处理,能够提升图像的处理效率,节省手机的功耗。
S1103、手机对第四图像进行第二图像处理,得到抓拍图像。该第二图像处理包括:图片降噪、亮度及验收校正和图片美化处理中的至少一种。
应理解,图片降噪(或图片去噪)用于减少图像中的噪声,可以提升图像的清晰度,提升抓拍图像的图像质量。亮度及验收校正用于对图像的亮度及颜色进行校准,也可以提升抓拍图像的图像质量。手机可以基于美肤算法对图像进行图片美化处理,可以提升图像的美观性,提高抓拍图像的显示效果。
本实施例中,手机可以对m帧第一图像进行图像融合得到第三图像,并且按照目标预览图像的裁剪方式和裁剪参数对第三图像进行裁剪处理,得到第四图像。即电子设备通过将多帧第一图像融合成一帧图像(即第三图像),并且只需要对一帧图像进行裁剪处理,可以提升图像的处理效率,节省手机的功耗。
另外,手机还可以对第四图像进行图片降噪、亮度及验收校正以及图片美化处理等处理过程,能够提升抓拍图像的图像质量及显示效果。
示例性的,结合上述图4A中得到预览流的过程,说明本申请实施例中得到抓拍图像的具体过程。
例如,如图12所示,手机的图像传感器(Sensor)输出的每一帧Bayer图像经过手机的图像前端引擎、第一缓存队列、选帧模块、多帧融合模块、拜尔处理模块、图像处理引擎、风格化处理模块,便可以得到上述抓拍图像。
具体的,第一缓存队列用于缓存摄像头采集的第一图像。选帧模块用于从第一缓存队列缓存的n帧第一图像中选择出第二图像。多帧融合模块用于对m帧第一图像进行图像融合,得到第三图像。拜耳处理模块用于按照目标预览图像的裁剪方式和裁剪参数,对第三图像进行裁剪处理,得到第四图像;拜尔处理模块还用于对第四图像进行亮度及验收校正等处理。风格化处理模块用于对第四图像进行色彩处理、图片美化处理以及高动态处理等。
应理解,拜尔处理模块可以从防抖模块中获取目标预览图像的裁剪方式和裁剪参数。
结合图6,如图13所示,在上述S605之前,本申请实施例提供的方法还包括S1301-S1302。
S1301、手机获取目标预览图像的逻辑标识。该目标预览图像的逻辑标识用于标识采集该目标预览图像的摄像头。
结合上述实施例的描述,应理解,手机可以包括N个摄像头,N为大于1的正整数。手机可以通过多个摄像头采集第一图像。
S1302、手机根据目标预览图像的逻辑标识,从n帧第一图像中确定出包括第二图像的m帧第一图像。该m帧第一图像的逻辑标识与该目标预览图像的逻辑标识相同。
在一种情况下,一个逻辑标识对应一个摄像头。该m帧第一图像的逻辑标识与该目标预览图像的逻辑标识相同,说明采集该m帧第一图像的摄像头与采集该目标预览图像的摄像头为相同的摄像头。此时手机可以将n帧第一图像中,与采集目标预览图像的摄像头相同的摄像头采集的一帧第一图像,确定为m帧第一图像中的一帧图像。能够方便、快捷地从n帧第一图像中确定出m帧第一图像,进而可以提升抓拍图像的生成效率。
在另一种情况下,一个逻辑标识可以对应多个摄像头,也可以理解为一个逻辑标识对应一个摄像头集合,该摄像头集合中包括多个摄像头。该m帧第一图像的逻辑标识与该目标预览图像的逻辑标识相同,说明采集该m帧第一图像的摄像头与采集该目标预览图像的摄像头属于同一个摄像头集合。此时手机可以将该同一个摄像头集合中包括的多个摄像头中每一个摄像头采集的一帧第一图像,确定为m帧第一图像中的一帧图像。可以确定出大量的、待进行第一图像处理的第一图像,进而可以提升抓拍图像的图像质量。
在本申请实例的一种实现方式中,手机还可以采集该手机的当前温度。若手机的当前温度大于预设温度阈值,则上述m帧第一图像仅包括第二图像;若手机的当前温度小于或等于该预设温度阈值,则m≥2。
应理解,若手机的当前温度大于预设温度阈值时,说明手机的当前温度较高,如果继续高负荷运行可能会影响手机的性能。此时手机可以启用热逃生机制,具体为进行第一图像处理的时候只使用一帧图像(即第二图像),能够降低手机的运行负荷,节省手机的功耗。
若手机的当前温度小于或等于该预设温度阈值时,说明手机的当前温度不是很高。此时手机可以基于多帧第一图像得到抓拍图像,能够提升抓拍图像的图像质量。
本申请另一些实施例提供了一种电子设备,该电子设备可以包括:上述显示屏、摄像头、存储器和一个或多个处理器。该显示屏、摄像头、存储器和处理器耦合。该存储器用于存储计算机程序代码,该计算机程序代码包括计算机指令。当处理器执行计算机指令时,电子设备可执行上述方法实施例中手机执行的各个功能或者步骤。该电子设备的结构可以参考图5A所示的电子设备的结构。
本申请实施例还提供一种芯片系统,如图14所示,该芯片系统1400包括至少一个处理器1401和至少一个接口电路1402。处理器1401和接口电路1402可通过线路互联。例如,接口电路1402可用于从其它装置(例如电子设备的存储器)接收信号。又例如,接口电路1402可用于向其它装置(例如处理器1401)发送信号。示例性的, 接口电路1402可读取存储器中存储的指令,并将该指令发送给处理器1401。当所述指令被处理器1401执行时,可使得电子设备执行上述实施例中的各个步骤。当然,该芯片系统还可以包含其他分立器件,本申请实施例对此不作具体限定。
本申请实施例还提供一种计算机存储介质,该计算机存储介质包括计算机指令,当所述计算机指令在上述电子设备上运行时,使得该电子设备执行上述方法实施例中手机执行的各个功能或者步骤。
本申请实施例还提供一种计算机程序产品,当所述计算机程序产品在计算机上运行时,使得所述计算机执行上述方法实施例中手机执行的各个功能或者步骤。
通过以上实施方式的描述,所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个装置,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是一个物理单元或多个物理单元,即可以位于一个地方,或者也可以分布到多个不同地方。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个可读取存储介质中。基于这样的理解,本申请实施例的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该软件产品存储在一个存储介质中,包括若干指令用以使得一个设备(可以是单片机,芯片等)或处理器(processor)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。
以上内容,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何在本申请揭露的技术范围内的变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。

Claims (11)

  1. 一种录像中抓拍图像的方法,其特征在于,应用于电子设备,所述方法包括:
    所述电子设备接收用户的第一操作;其中,所述第一操作用于触发所述电子设备开始录制视频;
    响应于所述第一操作,所述电子设备的摄像头采集第一图像,所述电子设备显示第一界面;其中,所述第一界面是所述电子设备正在录制视频的取景界面,所述取景界面显示预览流,所述预览流包括由所述第一图像得到的预览图像,所述第一界面还包括抓拍快门,所述抓拍快门用于触发所述电子设备抓拍图像得到照片;
    所述电子设备在第一缓存队列缓存所述摄像头采集的第一图像;其中,所述第一缓存队列缓存所述摄像头采集的n帧第一图像,n≥1,n为整数;
    所述电子设备响应于用户对所述抓拍快门的第二操作,根据第一图像的附加信息,从所述第一缓存队列缓存的n帧第一图像中选择出第二图像;其中,所述第一图像的附加信息包括所述第一图像的对比度、所述摄像头采集所述第一图像时的角速度和所述第一图像的时间戳中的至少一个;
    所述电子设备对包括所述第二图像的m帧第一图像进行第一图像处理,得到抓拍图像;其中,所述第一图像处理包括按照目标预览图像的裁剪方式和裁剪参数的裁剪处理,所述目标预览图像是所述预览流中、所述电子设备接收到所述第二操作时所述摄像头采集的一帧图像,m≥1,m为整数。
  2. 根据权利要求1所述的方法,其特征在于,所述目标预览图像的裁剪方式包括中心裁剪方式,所述目标预览图像的裁剪参数包括所述目标预览图像的裁剪区域的中心点坐标和裁剪的尺寸信息。
  3. 根据权利要求1或2所述的方法,其特征在于,m≥2,所述电子设备对包括所述第二图像的m帧第一图像进行第一图像处理,得到抓拍图像,包括:
    所述电子设备对所述m帧第一图像进行图像融合,得到第三图像;
    所述电子设备按照所述目标预览图像的裁剪方式和裁剪参数,对所述第三图像进行裁剪处理,得到第四图像;
    所述电子设备对所述第四图像进行第二图像处理,得到所述抓拍图像;其中,所述第二图像处理包括:图片降噪、亮度及验收校正和图片美化处理中的至少一种。
  4. 根据权利要求3所述的方法,其特征在于,所述方法还包括:
    所述电子设备获取所述目标预览图像的逻辑标识,所述目标预览图像的逻辑标识用于标识采集所述目标预览图像的摄像头;
    所述电子设备根据所述目标预览图像的逻辑标识,从所述n帧第一图像中确定出包括所述第二图像的所述m帧第一图像;
    其中,所述m帧第一图像的逻辑标识与所述目标预览图像的逻辑标识相同。
  5. 根据权利要求3所述的方法,其特征在于,所述m帧第一图像为所述n帧第一图像中包括所述第二图像的连续m帧图像;或者,
    所述m帧第一图像包括所述第二图像,以及所述n帧第一图像中、分辨率大于预 设分辨率阈值的m-1帧第一图像;或者,
    所述m帧第一图像包括所述第二图像,以及所述n帧第一图像中、高动态范围HDR参数满足预设HDR条件的m-1帧第一图像。
  6. 根据权利要求1-5中任一项所述的方法,其特征在于,所述第一图像的附加信息包括所述第一图像的对比度,所述第一图像的对比度用于表征所述第一图像的清晰度;
    所述第二图像为:所述第一缓存队列缓存的所述n帧第一图像中,对比度最高的第一图像。
  7. 根据权利要求1-6中任一项所述的方法,其特征在于,所述第一图像的附加信息包括所述摄像头采集所述第一图像时的角速度,所述角速度用于表征所述摄像头采集所述第一图像时的抖动情况;
    所述第二图像为:所述第一缓存队列缓存的所述n帧第一图像中,角速度最小的第一图像。
  8. 根据权利要求1-7中任一项所述的方法,其特征在于,所述第一图像的附加信息还包括所述第一图像的时间戳;每一帧第一图像中包括时间戳,所述时间戳记录有所述电子设备的图像传感器输出对应第一图像的时间;
    其中,所述电子设备中上层应用的时钟与所述图像传感器记录第一图像出图的时钟同步;或者,所述电子设备中上层应用的时钟与所述图像传感器记录第一图像出图的时钟为同一系统时钟;
    所述第二图像为:所述第一缓存队列缓存的所述n帧第一图像中,时间戳记录的时间与所述电子设备接收到所述第二操作的时间最近的第一图像。
  9. 根据权利要求3-5中任一项所述的方法,其特征在于,所述方法还包括:
    所述电子设备采集所述电子设备的当前温度;
    其中,若所述电子设备的当前温度大于预设温度阈值,则所述m帧第一图像仅包括所述第二图像,m=1;若所述电子设备的当前温度小于或等于所述预设温度阈值,则m≥2。
  10. 一种电子设备,其特征在于,包括:触摸屏、存储器、摄像头、显示屏、温度传感器和一个或多个处理器;所述触摸屏、所述存储器、所述摄像头、所述显示屏与所述处理器耦合;其中,所述存储器中存储有计算机程序代码,所述计算机程序代码包括计算机指令,当所述计算机指令被所述处理器执行时,使得所述电子设备执行如权利要求1-9任一项所述的方法。
  11. 一种计算机可读存储介质,其特征在于,包括计算机指令,当所述计算机指令在电子设备上运行时,使得所述电子设备执行如权利要求1-9中任一项所述的方法。
PCT/CN2023/113138 2022-09-14 2023-08-15 一种录像中抓拍图像的方法及电子设备 WO2024055797A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP23864531.1A EP4436198A1 (en) 2022-09-14 2023-08-15 Method for capturing images in video, and electronic device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211116174.5 2022-09-14
CN202211116174.5A CN116320783B (zh) 2022-09-14 2022-09-14 一种录像中抓拍图像的方法及电子设备

Publications (2)

Publication Number Publication Date
WO2024055797A1 true WO2024055797A1 (zh) 2024-03-21
WO2024055797A9 WO2024055797A9 (zh) 2024-05-02

Family

ID=86829269

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/113138 WO2024055797A1 (zh) 2022-09-14 2023-08-15 一种录像中抓拍图像的方法及电子设备

Country Status (3)

Country Link
EP (1) EP4436198A1 (zh)
CN (1) CN116320783B (zh)
WO (1) WO2024055797A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116320783B (zh) * 2022-09-14 2023-11-14 荣耀终端有限公司 一种录像中抓拍图像的方法及电子设备
CN117692791B (zh) * 2023-07-27 2024-10-18 荣耀终端有限公司 一种图像抓拍方法、终端、存储介质及程序产品
CN117689559B (zh) * 2023-08-07 2024-08-02 上海荣耀智慧科技开发有限公司 一种图像融合方法、装置、电子设备及存储介质

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130194481A1 (en) * 2012-01-29 2013-08-01 Michael Golub Snapshot spectral imaging based on digital cameras
CN103685933A (zh) * 2012-09-20 2014-03-26 宏达国际电子股份有限公司 视频及多张静态影像平行产生方法以及使用此方法的装置
CN105635614A (zh) * 2015-12-23 2016-06-01 小米科技有限责任公司 录像照相方法、装置及终端电子设备
CN110290323A (zh) * 2019-06-28 2019-09-27 Oppo广东移动通信有限公司 图像处理方法、装置、电子设备和计算机可读存储介质
CN111970440A (zh) * 2020-08-11 2020-11-20 Oppo(重庆)智能科技有限公司 图像获取方法、电子装置和存储介质
CN113810608A (zh) * 2021-09-14 2021-12-17 荣耀终端有限公司 一种拍摄方法、电子设备及存储介质
CN116320783A (zh) * 2022-09-14 2023-06-23 荣耀终端有限公司 一种录像中抓拍图像的方法及电子设备

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8520970B2 (en) * 2010-04-23 2013-08-27 Flir Systems Ab Infrared resolution and contrast enhancement with fusion
JP6370207B2 (ja) * 2014-12-17 2018-08-08 オリンパス株式会社 撮像装置、画像処理装置、撮像方法、およびプログラム
CN106791408A (zh) * 2016-12-27 2017-05-31 努比亚技术有限公司 一种拍摄预览装置、终端及方法
CN112887583B (zh) * 2019-11-30 2022-07-22 华为技术有限公司 一种拍摄方法及电子设备
CN113542613B (zh) * 2020-04-14 2023-05-12 华为技术有限公司 一种用于拍照的装置及方法
CN114845035B (zh) * 2021-01-30 2024-04-26 华为技术有限公司 一种分布式拍摄方法,电子设备及介质
CN112738414B (zh) * 2021-04-06 2021-06-29 荣耀终端有限公司 一种拍照方法、电子设备及存储介质
CN113810600B (zh) * 2021-08-12 2022-11-11 荣耀终端有限公司 终端的图像处理方法、装置和终端设备

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130194481A1 (en) * 2012-01-29 2013-08-01 Michael Golub Snapshot spectral imaging based on digital cameras
CN103685933A (zh) * 2012-09-20 2014-03-26 宏达国际电子股份有限公司 视频及多张静态影像平行产生方法以及使用此方法的装置
CN105635614A (zh) * 2015-12-23 2016-06-01 小米科技有限责任公司 录像照相方法、装置及终端电子设备
CN110290323A (zh) * 2019-06-28 2019-09-27 Oppo广东移动通信有限公司 图像处理方法、装置、电子设备和计算机可读存储介质
CN111970440A (zh) * 2020-08-11 2020-11-20 Oppo(重庆)智能科技有限公司 图像获取方法、电子装置和存储介质
CN113810608A (zh) * 2021-09-14 2021-12-17 荣耀终端有限公司 一种拍摄方法、电子设备及存储介质
CN116320783A (zh) * 2022-09-14 2023-06-23 荣耀终端有限公司 一种录像中抓拍图像的方法及电子设备

Also Published As

Publication number Publication date
EP4436198A1 (en) 2024-09-25
CN116320783B (zh) 2023-11-14
WO2024055797A9 (zh) 2024-05-02
CN116320783A (zh) 2023-06-23

Similar Documents

Publication Publication Date Title
WO2024055797A1 (zh) 一种录像中抓拍图像的方法及电子设备
CN113099146B (zh) 一种视频生成方法、装置及相关设备
WO2023160170A1 (zh) 拍摄方法和电子设备
CN115689963B (zh) 一种图像处理方法及电子设备
WO2023035921A1 (zh) 一种录像中抓拍图像的方法及电子设备
CN113536866A (zh) 一种人物追踪显示方法和电子设备
WO2024179101A9 (zh) 一种拍摄方法
WO2024179101A1 (zh) 一种拍摄方法
WO2021204103A1 (zh) 照片预览方法、电子设备和存储介质
WO2024179100A1 (zh) 一种拍摄方法
WO2023036007A1 (zh) 一种获取图像的方法及电子设备
WO2023231696A1 (zh) 一种拍摄方法及相关设备
WO2023160230A9 (zh) 一种拍摄方法及相关设备
CN115883958A (zh) 一种人像拍摄方法
CN115802147B (zh) 一种录像中抓拍图像的方法及电子设备
WO2023035920A1 (zh) 一种录像中抓拍图像的方法及电子设备
CN117692753B (zh) 一种拍照方法及电子设备
CN117389745B (zh) 一种数据处理方法、电子设备及存储介质
CN117956264B (zh) 拍摄方法、电子设备、存储介质和程序产品
WO2023035868A1 (zh) 拍摄方法及电子设备
WO2024179108A1 (zh) 一种拍照方法及电子设备
CN118870186A (zh) 拍摄方法及电子设备
CN117857915A (zh) 一种拍照方法、拍照装置及电子设备
CN118555468A (zh) 一种拍摄方法、图形界面及电子设备
CN118555470A (zh) 一种拍照方法及电子设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23864531

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2023864531

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2023864531

Country of ref document: EP

Effective date: 20240621