DE102019113592A1

DE102019113592A1 - Vehicle user assistant based on an analysis of an attention zone

Info

Publication number: DE102019113592A1
Application number: DE102019113592.3A
Authority: DE
Inventors: Antonyo Musabini; Hervé Marcasuzaa
Original assignee: Valeo Schalter und Sensoren GmbH
Current assignee: Valeo Schalter und Sensoren GmbH
Priority date: 2019-05-22
Filing date: 2019-05-22
Publication date: 2020-11-26

Abstract

Die vorliegende Erfindung betrifft ein Verfahren und ein System zum Beantworten einer Benutzeranfrage, die mit einem Objekt (O1, O2, O3) oder einem Bereich außerhalb des Fahrzeugs in Beziehung steht. Insbesondere betrifft die Erfindung ein Verfahren und ein System zum Erweitern fahrzeuginterner Sprachassistenten, indem ihnen zusätzliche Kontextinformation zur Verfügung gestellt wird, die aus visueller Information gelernt wird, immer wenn eine sprachbasierte Anfrage gestellt wird und wenn diese Anfrage offen oder teilweise offen ist. Das Verfahren beschreibt, wie diese visuelle Information unter Verwendung des Blickvektors des Fahrerblicks, der durch die Frontkamera des Fahrzeugs aufgenommenen Szene und der Verwendung von Bilderkennungslösungen extrahiert wird und wie dieser Prozess in herkömmliche Sprachassistentenlösungen integriert wird. Die vorliegende Erfindung betrifft auch ein Computerprogramm, das Programmcodeanweisungen zum Ausführen der Schritte des oben erwähnten Verfahrens enthält.The present invention relates to a method and a system for answering a user query which is related to an object (O1, O2, O3) or an area outside the vehicle. In particular, the invention relates to a method and a system for expanding in-vehicle voice assistants by providing them with additional context information that is learned from visual information whenever a voice-based request is made and when this request is open or partially open. The method describes how this visual information is extracted using the gaze vector of the driver's gaze, the scene recorded by the front camera of the vehicle and the use of image recognition solutions, and how this process is integrated into conventional voice assistant solutions. The present invention also relates to a computer program containing program code instructions for carrying out the steps of the above-mentioned method.

Description

Die vorliegende Erfindung betrifft ein Verfahren und ein System zum Beantworten einer Benutzeranfrage, die mit einem Objekt oder einem Bereich außerhalb des Fahrzeugs in Beziehung steht.The present invention relates to a method and a system for responding to a user query which is related to an object or an area outside the vehicle.

Auch betrifft die vorliegende Erfindung ein Computerprogramm, das Programmcodeanweisungen zum Ausführen der Schritte des oben erwähnten Verfahrens enthält.The present invention also relates to a computer program which contains program code instructions for carrying out the steps of the above-mentioned method.

Sprachassistenten basieren normalerweise nur auf der Analyse der natürlichen Sprache (Natural Language Analysis) von Sprachanfragen zum Bereitstellen aussagekräftiger Antworten. Es sind Lösungen vorgeschlagen worden, um Sprachassistenten für ein natürlicheres Benutzergefühl zusätzlich zu Sprachanfragen Kontextinformation zur Verfügung zu stellen.Voice assistants typically rely only on natural language analysis of language queries to provide meaningful answers. Solutions have been proposed to provide context information for voice assistants in addition to voice requests for a more natural user experience.

DE 10 2016 221 983 A1 veröffentlicht zum Beispiel ein Verfahren, bei dem ein Objekt oder ein Bereich außerhalb eines Fahrzeugs bestimmt wird, auf das/den sich die Aufmerksamkeit mindestens eines Benutzers für eine Zeitspanne richtet. Es wird dann geprüft, ob mit dem erfassten Objekt oder dem erfassten Bereich in Beziehung stehende Information in einem Speicher des Fahrzeugs gespeichert ist, um ein Prüfergebnis zu erhalten. Bei positivem Prüfergebnis wird die gespeicherte Information ausgegeben. Darüber hinaus veröffentlicht DE 10 2016 221 983 A1 zum Beispiel im Abschnitt [0065], ein Auto 400 mit einem Speichergerät 440 das Information speichert über Objekten 110 außerhalb des Autos befinden. Das Auto verfügt über ein Gerät 410 das detektieren kann auf welches Objekt der Fahrer des Autos seinen Blick richtet. DE 10 2016 221 983 A1 publishes, for example, a method in which an object or an area outside of a vehicle is determined to which at least one user's attention is directed for a period of time. It is then checked whether information related to the detected object or the detected area is stored in a memory of the vehicle in order to obtain a test result. If the test result is positive, the stored information is output. In addition, published DE 10 2016 221 983 A1 For example, in section [0065], a car 400 with a storage device 440 that stores information about objects 110 located outside the car. The car has a device 410 that can detect which object the driver of the car is looking at.

Das vorstehende Verfahren hat jedoch den Nachteil, dass es manchmal schwierig ist, zu identifizieren, welches Objekt oder welcher Bereich tatsächlich durch den Benutzer angefragt wird, wodurch das System nur begrenzte Möglichkeiten zum Identifizieren und Beantworten der Anfrage hat.The above method, however, has the disadvantage that it is sometimes difficult to identify which object or which area is actually requested by the user, as a result of which the system has only limited possibilities for identifying and answering the request.

Der vorliegenden Erfindung liegt die Aufgabe zugrunde, eine bessere Möglichkeit zum Implementieren ähnlicher Aufgaben anzugeben.The present invention seeks to provide a better way of implementing similar tasks.

Die Lösung der Aufgabe erfolgt durch die unabhängigen Ansprüche. Vorteilhafte Ausgestaltungen sind in den Unteransprüchen angegeben.The problem is solved by the independent claims. Advantageous refinements are given in the subclaims.

Insbesondere ist durch die vorliegende Erfindung ein Verfahren zum Beantworten einer Benutzeranfrage angegeben, die mit einem Objekt oder einem Bereich außerhalb des Fahrzeugs in Beziehung steht, aufweisend die folgenden Schritte:

Erfassen einer Benutzeranfrage bezüglich eines Objekts oder eines Bereichs durch ein Assistentenmodul zu einem Zeitpunkt,
Prüfen, ob mehr Kontextinformation über das angefragte Objekt oder den angefragten Bereich im Vergleich zu vorhandener Information benötigt wird, um ein Prüfergebnis zu erhalten, und
wenn ein Prüfergebnis „Falsch“ ist, vorzugsweise Verarbeiten der Benutzeranfrage und Ausgeben von Information,
wenn ein Prüfergebnis „Wahr“ ist, Prüfen, ob die Benutzeranfrage eine offene oder eine teilweise offene Anfrage ist, wobei eine offene Anfrage kein Referenzschlüsselwort aufweist und eine teilweise offene Anfrage mindestens ein Referenzschlüsselwort aufweist,
im Falle einer offenen Anfrage, Erfassen der Blickrichtung des Benutzers und Abrufen von Information über ein Objekt am Schnittpunkt des Blickvektors mit mindestens einem Sichtfeld einer Frontkamera eines Fahrzeugs bezüglich des Zeitpunkts durch ein blickgesteuertes Interaktionsmodul, und Zuschneiden eines Bildes des Sichtfeldes der Frontkamera des Fahrzeugs durch das blickgesteuerte Interaktionsmodul,
Anfordern eines allgemeinen Bilderkennungsprozesses bezüglich des Bildes durch ein Bilderkennungsmodul, nachdem ein Bild des Sichtfelds der Frontkamera des Fahrzeugs zugeschnitten wurde,
im Falle einer teilweise offenen Anfrage, Erfassen der Blickrichtung des Benutzers und Abrufen von Information über ein Objekt am Schnittpunkt des Blickvektors mit mindestens einem Sichtfeld einer Frontkamera eines Fahrzeugs bezüglich des Zeitpunkts durch ein blickgesteuertes Interaktionsmodul, und Zuschneiden eines Bildes des Sichtfelds der Frontkamera des Fahrzeugs durch das blickgesteuerte Interaktionsmodul.

In particular, the present invention specifies a method for answering a user request that is related to an object or an area outside the vehicle, comprising the following steps:

Recording of a user request regarding an object or an area by an assistant module at a time
Check whether more context information about the requested object or the requested area is required compared to the information available in order to obtain a check result, and
if a test result is "false", preferably processing the user request and outputting information,
if a test result is "true", checking whether the user request is an open or a partially open request, an open request having no reference keyword and a partially open request having at least one reference keyword,
In the case of an open request, recording the direction of view of the user and retrieving information about an object at the intersection of the gaze vector with at least one field of view of a front camera of a vehicle with respect to the time by a gaze-controlled interaction module, and cropping an image of the field of view of the front camera of the vehicle by the eye-controlled interaction module,
Requesting a general image recognition process relating to the image by an image recognition module after an image of the field of view of the vehicle's front camera has been cropped,
in the case of a partially open request, recording the direction of view of the user and retrieving information about an object at the intersection of the gaze vector with at least one field of view of a front camera of a vehicle with respect to the time by a gaze-controlled interaction module, and cropping an image of the field of view of the front camera of the vehicle the eye-controlled interaction module.

Anfordern eines allgemeinen Bilderkennungsprozesses bezüglich des Bildes durch ein Bilderkennungsmodul, nachdem ein Bild des Sichtfelds der Frontkamera des Fahrzeugs zugeschnitten wurde,
wobei Information des mindestens einen Referenzschlüsselworts extrahiert wird, um den allgemeinen Bilderkennungsprozess bezüglich des Bildes durch das Bilderkennungsmodul anzufordern.Requesting a general image recognition process relating to the image by an image recognition module after an image of the field of view of the vehicle's front camera has been cropped,
wherein information of the at least one reference keyword is extracted in order to request the general image recognition process relating to the image by the image recognition module.

Mindestens ein Sichtfeld der Frontkamera des Fahrzeugs bedeutet, dass mindestens eine Kamera aus möglicherweise vielen Kameras in der Lage ist die Vorderseite des Fahrzeugs aufnehmen.At least one field of view of the front camera of the vehicle means that at least one camera from possibly many cameras is able to record the front of the vehicle.

Durch die vorliegende Erfindung ist auch ein System zum Beantworten einer Benutzeranfrage angegeben, die mit einem Objekt oder einem Bereich außerhalb des Fahrzeugs in Beziehung steht.The present invention also provides a system for responding to a user query that is related to an object or area outside the vehicle.

Grundidee der Erfindung ist es, dass ein Analyseprozess auf der Basis der Benutzeranfrage getriggert wird, der insbesondere die Blickrichtung des Benutzers bestimmt und mit einem geeignet ausgestatteten System und Fahrzeug verfolgt, welches Objekt oder welcher Bereich sich im Blickfeld des Benutzers befindet. Basierend auf dieser Identifizierung des Sichtfeldes wird ein Bild aufgenommen und automatisch mit einem Datenbanksystem abgeglichen. In Abhängigkeit vom Komplexitätsgrad der Anfragen und der Vollständigkeit der Datenbank kann der Benutzer unterschiedlich detaillierte Antworten auf seine Anfragen erhalten. Vorteilhafterweise kann der Benutzer seine Hände am Lenkrad halten und gleichzeitig ein genaueres Ergebnis über das tatsächlich betrachtete Objekt oder den tatsächlich betrachteten Bereich erhalten. Diese Erfindung schlägt einen besseren Weg vor, dank der Blick- und Bilderkennung ähnliche Aufgaben und mehr zu implementieren, ohne auf Gesten, GPS-Koordinaten oder vorab gespeicherte Orte von Interesse angewiesen zu sein.The basic idea of the invention is that an analysis process is triggered on the basis of the user request, which in particular determines the direction of view of the user and, with a suitably equipped system and vehicle, tracks which object or which area is in the user's field of vision. Based on this identification of the field of view, an image is recorded and automatically compared with a database system. Depending on the complexity of the queries and the completeness of the database, the user can receive differently detailed answers to his queries. The user can advantageously keep his hands on the steering wheel and at the same time obtain a more precise result about the object actually viewed or the area actually viewed. This invention suggests a better way of using gaze and image recognition to implement similar tasks and more without relying on gestures, GPS coordinates, or pre-saved locations of interest.

Gemäß einer modifizierten Ausführungsform der Erfindung beinhaltet im Falle des Empfangs eines bildbezogenen Vorschlags aus irgendeinem Erkennungsprozess, Zusammenführen jeglicher bildbezogener Vorschläge,
wobei nach dem Zusammenführen jeglicher bildbezogener Vorschläge zu neuer Kontextinformation die neue Kontextinformation der vorhandenen Information gemäß der Benutzeranfrage durch das Assistentenmodul hinzugefügt wird, woraufhin die Benutzeranfrage verarbeitet und Information ausgegeben wird,
Verarbeiten der Benutzeranfrage und Ausgeben von Information, falls nach der Anforderung des allgemeinen Bilderkennungsprozesses kein bildbezogener Vorschlag empfangen wird.According to a modified embodiment of the invention, in the case of receiving an image-related proposal from any recognition process, merging any image-related proposals,
wherein after the merging of any image-related suggestions for new context information, the new context information is added to the existing information according to the user request by the assistant module, whereupon the user request is processed and information is output,
Processing the user request and outputting information if no image-related proposal is received after the general image recognition process request.

Gemäß einer modifizierten Ausführungsform der Erfindung beinhaltet das Zuschneiden eines Bildes des Sichtfelds der Frontkamera des Fahrzeugs im Falle einer offenen Anfrage zwei Optionen, nämlich:

wenn das Abrufen von Information über ein Objekt am Schnittpunkt des Blickvektors mit dem Sichtfeld der Frontkamera des Fahrzeugs bezüglich des Zeitpunkts nicht erfolgreich ist, wird ein Bildausschnitt mit einer Standardgröße zugeschnitten,
wenn das Abrufen von Information über ein Objekt am Schnittpunkt des Blickvektors mit dem Sichtfeld der Frontkamera des Fahrzeugs bezüglich des Zeitpunkts erfolgreich war, wird ein Bildausschnitt zugeschnitten, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des Objekts am Schnittpunkt berechnet wird, insbesondere berechnet wird von Sensoren wie LiDAR oder Radar.
wobei der allgemeine Bilderkennungsprozess bezüglich des Bildes durch das Bilderkennungsmodul nach dem Zuschneiden eines Bildausschnitts mit der Standardgröße oder nach dem Zuschneiden eines Bildausschnitts angefordert wird, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des Objekts am Schnittpunkt berechnet wird.

According to a modified embodiment of the invention, cropping an image of the field of view of the front camera of the vehicle in the case of an open request includes two options, namely:

if the retrieval of information about an object at the point of intersection of the gaze vector with the field of view of the front camera of the vehicle is unsuccessful with regard to the point in time, an image section is cropped with a standard size,
If the retrieval of information about an object at the point of intersection of the gaze vector with the field of view of the front camera of the vehicle was successful with respect to the point in time, an image section is cropped, the size of which is calculated from the apparent distance and the apparent size of the object at the point of intersection, in particular is calculated from sensors such as LiDAR or radar.
wherein the general image recognition process relating to the image is requested by the image recognition module after cropping an image section with the standard size or after cropping an image section, the size of which is calculated from the apparent distance and the apparent size of the object at the point of intersection.

Im Falle einer offenen Anfrage ermöglicht diese Maßnahme nach Möglichkeit einen besseren Objektempfang. Das Zuschneiden eines Bildausschnitts mit der Standardgröße ermöglicht eine umfassende Bildanalyse durch ein qualifiziertes nachgeschaltetes Bildanalysesystem. Dies ist insbesondere dann von Vorteil, wenn bei der Sichtfeldprüfung kein Objekt oder Bereich genauer identifiziert werden kann. Im Gegensatz dazu ermöglicht das Zuschneiden eines Bildausschnitts, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des zugeschnittenen Objekts berechnet wird, eine schnellere Bildauswertung, da ein spezifisches Objekt oder ein spezifischer Bereich analysiert wird. Es ist keine nachfolgende Objekt- oder Bereichserkennung erforderlich, so dass das Bilderkennungsmodul sofort mit einem Datenbankabgleich zur genaueren Bestimmung starten kann.In the case of an open request, this measure enables better property reception if possible. The cropping of an image section with the standard size enables a comprehensive image analysis by a qualified downstream image analysis system. This is particularly advantageous if no object or area can be identified more precisely when checking the field of view. In contrast to this, the cropping of an image section, the size of which is calculated from the apparent distance and the apparent size of the cropped object, enables faster image evaluation since a specific object or a specific area is analyzed. No subsequent object or area recognition is required, so that the image recognition module can immediately start with a database comparison for more precise determination.

Gemäß einer modifizierten Ausführungsform der Erfindung beinhaltet das Zuschneiden eines Bildes des Sichtfelds der Frontkamera des Fahrzeugs im Fall einer teilweise offenen Anfrage zwei Optionen, nämlich:

Zuschneiden eines Bildausschnitts mit einer Standardgröße, wenn das Abrufen von Information über ein Objekt am Schnittpunkt des Blickvektors mit dem Sichtfeld der Frontkamera des Fahrzeugs bezüglich des Zeitpunkts nicht erfolgreich ist,
Zuschneiden eines Bildausschnitts, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des zugeschnittenen Objekts berechnet wird, wenn das Abrufen von Information über ein Objekt am Schnittpunkt des Blickvektors mit dem Sichtfeld der Frontkamera des Fahrzeugs bezüglich des Zeitpunkts erfolgreich war,
wobei der allgemeine Bilderkennungsprozess bezüglich des Bildes durch das Bilderkennungsmodul nach dem Zuschneiden eines Bildausschnitts mit der Standardgröße oder nach dem Zuschneiden eines Bildausschnitts angefordert wird, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des Objekts am Schnittpunkt berechnet wird.

According to a modified embodiment of the invention, the cropping of an image of the field of view of the front camera of the vehicle in the case of a partially open request includes two options, namely:

Cropping an image section with a standard size if the retrieval of information about an object at the point of intersection of the gaze vector with the field of view of the front camera of the vehicle is unsuccessful with regard to the point in time,
Cropping of an image section, the size of which is calculated from the apparent distance and the apparent size of the cropped object, if the retrieval of information about an object at the intersection of the gaze vector with the field of view of the front camera of the vehicle was successful with regard to the point in time,
wherein the general image recognition process with respect to the image by the image recognition module after cropping an image section with the standard size or after A cropping of an image section is requested, the size of which is calculated from the apparent distance and the apparent size of the object at the point of intersection.

Im Fall einer teilweise offenen Anfrage ermöglicht diese Maßnahme nach Möglichkeit einen besseren Objektempfang. Das Zuschneiden eines Bildausschnitts mit der Standardgröße ermöglicht eine umfassende Bildanalyse durch ein qualifiziertes nachgeschaltetes Bildanalysesystem. Dies ist insbesondere dann von Vorteil, wenn bei der Sichtfeldprüfung kein Objekt oder Bereich genauer identifiziert werden kann. Im Gegensatz dazu ermöglicht das Zuschneiden eines Bildausschnitts, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des Objekts am Schnittpunkt berechnet wird, eine schnellere Bildauswertung, da ein spezifisches Objekt oder ein spezifischer Bereich analysiert wird. Es ist keine nachfolgende Objekt- oder Bereichserkennung erforderlich, so dass das Bilderkennungsmodul sofort mit einem Datenbankabgleich zur genaueren Bestimmung starten kann.In the case of a partially open request, this measure enables better object reception if possible. The cropping of an image section with the standard size enables a comprehensive image analysis by a qualified downstream image analysis system. This is particularly advantageous if no object or area can be identified more precisely when checking the field of view. In contrast to this, the cropping of an image section, the size of which is calculated from the apparent distance and the apparent size of the object at the intersection point, enables faster image evaluation, since a specific object or a specific area is analyzed. No subsequent object or area recognition is required, so that the image recognition module can immediately start with a database comparison for more precise determination.

Gemäß einer modifizierten Ausführungsform der Erfindung wird im Falle einer teilweise offenen Anfrage

nach dem Zuschneiden eines Bildes des Sichtfelds der Frontkamera des Fahrzeugs ein allgemeiner Bilderkennungsprozess bezüglich des Bildes angefordert, und, falls dies möglich ist, außerdem ein spezieller Objekterkennungsprozess für das mindestens eine Referenzschlüsselwort durch ein Bilderkennungsmodul angefordert,
wobei das Extrahieren von Information des mindestens einen Referenzschlüsselworts ausgeführt wird, um den allgemeinen Bilderkennungsprozess bezüglich des Bildes anzufordern und, falls dies möglich ist, auch um einen spezialisierten Objekterkennungsprozess für das mindestens eine Referenzschlüsselwort durch das Bilderkennungsmodul anzufordern.

According to a modified embodiment of the invention, in the case of a partially open request

after cropping an image of the field of view of the front camera of the vehicle, a general image recognition process with regard to the image is requested and, if this is possible, a special object recognition process for the at least one reference key word is also requested by an image recognition module,
wherein the extraction of information of the at least one reference keyword is carried out in order to request the general image recognition process relating to the image and, if this is possible, also to request a specialized object recognition process for the at least one reference keyword by the image recognition module.

Gemäß dieser beispielhaften Maßnahme ist es daher immer vorgesehen, dass der Schritt, der einen allgemeinen Bilderkennungsprozess bezüglich des Bildes anfordert, zuerst ausgeführt wird. Nur wenn beispielsweise ein ausreichend geeignetes Schlüsselwort verfügbar ist, ist das Anfordern eines spezialisierten Objekterkennungsprozesses für das mindestens eine Referenzschlüsselwort bevorzugt. In diesem Fall wird die Datenbankanalyse in Bezug auf das vorhandene Bild unter Verwendung des Referenzschlüsselworts ausgeführt. Dieses Referenzschlüsselwort kann den Datenbankbereich einschränken, so dass der bildbezogene Vorschlag in einer kurzen Zeit und mit einer hohen Übereinstimmungsgenauigkeit ausgegeben werden kann.According to this exemplary measure, it is therefore always provided that the step which requests a general image recognition process with regard to the image is carried out first. Only if a sufficiently suitable keyword is available, for example, is it preferred to request a specialized object recognition process for the at least one reference keyword. In this case, the database analysis is performed on the existing image using the reference keyword. This reference keyword can restrict the database area so that the image-related suggestion can be output in a short time and with a high degree of matching.

Gemäß einer modifizierten Ausführungsform der Erfindung kann das Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes durch das Bilderkennungsmodul mindestens ein Referenzschlüsselwort auslösen, so dass das Anfordern eines spezialisierten Objekterkennungsprozesses für das mindestens eine Referenzschlüsselwort der nächste Schritt ist.According to a modified embodiment of the invention, requesting the general image recognition process with respect to the image by the image recognition module can trigger at least one reference keyword, so that requesting a specialized object recognition process for the at least one reference keyword is the next step.

Da durch das Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes durch das Bilderkennungsmodul ein Referenzschlüsselwort gefunden werden kann, ist es möglich, eine offene Anfrage anschließend und ohne genaue Kenntnis des Benutzers in eine teilweise offene Anfrage zu ändern. Wenn ein Benutzer zum Beispiel fragt, welche Art von Objekt das betrachtete Objekt ist, kann durch Anfordern des allgemeinen Bilderkennungsprozess bezüglich des Bildes ein Auto identifiziert werden. Das System kann diese Information in einem nächsten Schritt zum Anfordern eines spezialisierten Objekterkennungsprozess für das mindestens eine Referenzschlüsselwort intern nutzen, um herauszufinden, um welches Modell es sich bei diesem Auto handelt.Since a reference key word can be found by requesting the general image recognition process with respect to the image by the image recognition module, it is possible to subsequently change an open request to a partially open request without precise knowledge of the user. For example, if a user asks what kind of object the viewed object is, a car can be identified by requesting the general image recognition process related to the image. In a next step, the system can use this information internally to request a specialized object recognition process for the at least one reference keyword in order to find out which model this car is.

Gemäß einer modifizierten Ausführungsform der Erfindung werden das Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes und/oder das Anfordern eines spezialisierten Objekterkennungsprozesses für das mindestens eine Referenzschlüsselwort extern ausgeführt, wobei das Bilderkennungsmodul ein abgesetzter Server oder ein Cloud-Server ist oder einen derartigen Server zumindest verwendet.According to a modified embodiment of the invention, the request for the general image recognition process with regard to the image and / or the request for a specialized object recognition process for the at least one reference keyword are carried out externally, the image recognition module being a remote server or a cloud server or at least using such a server.

Eine Datenzufuhr von einer externen Quelle stellt sicher, dass immer die aktuellsten Daten abrufbar sind. Beispielsweise können sich die Öffnungszeiten eines Orts von Interesse ändern oder kann ein Auto ein neues Modell sein. Diese Daten können vorzugsweise auf einem abgesetzten Server oder einem Cloud-Server verarbeitet werden. Insbesondere stellt ein abgesetzter Server Information bereit. Diese Information kann im Fahrzeug ausgewertet werden. Dies kann in Gebieten mit schlechtem Empfang vorteilhaft sein, um das Datenstromvolumen so gering wie möglich zu halten und damit eine kurze Ansprechzeit für das System zu gewährleisten. Zum anderen ist ein Cloud-Server derart konfiguriert, dass die eigentliche Datenanalyse auch auf dem Cloud-Server stattfindet. Der Gebrauch eines abgesetzten Servers oder eines Cloud-Servers reduziert die erforderliche Rechenkapazität die in das Auto benötigt wird. Hierbei ist zu berücksichtigen, dass ein Auto normalerweise eine erheblich längere Lebensdauer hat als ein Computergerät. Auslagern der Recheneinheit auf einen Cloud-Server verhindert, dass ein Auto (das wie vorher bemerkt eine Nutzungsdauer von mehreren Jahren haben kann) oder dessen Recheneinheit zu alt oder zu langsam ist um den erforderlichen Datenstrom zu verarbeiten.A data feed from an external source ensures that the latest data is always available. For example, the opening times of a place of interest may change or a car may be a new model. These data can preferably be processed on a remote server or a cloud server. In particular, a remote server provides information. This information can be evaluated in the vehicle. This can be advantageous in areas with poor reception in order to keep the data flow volume as low as possible and thus ensure a short response time for the system. On the other hand, a cloud server is configured in such a way that the actual data analysis also takes place on the cloud server. The use of a remote server or a cloud server reduces the computing capacity required in the car. It should be noted that a car usually has a significantly longer lifespan than a computer device. Outsourcing the computing unit to a cloud server prevents a car (which, as previously noted, can have a useful life of several years) or its Computing unit is too old or too slow to process the required data stream.

Gemäß einer modifizierten Ausführungsform der Erfindung wird das Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes primär im Fahrzeug und nur dann extern ausgeführt, wenn eine weitere Nachfrage existiert, wobei das Anfordern eines spezialisierten Objekterkennungsprozesses für das mindestens eine Referenzschlüsselwort immer extern ausgeführt wird.According to a modified embodiment of the invention, the request for the general image recognition process with regard to the image is primarily carried out in the vehicle and only externally if there is a further request, with the request for a specialized object recognition process for the at least one reference keyword always being carried out externally.

Dies kann eine sinnvolle Verteilung von Ressourcen insoweit darstellen, als der Schritt zum Anfordern des allgemeinen Bilderkennungsprozess bezüglich des Bildes für eine anfängliche Anfrage einfacher Begriffe oder Objekte verwendet werden kann. Beispielsweise kann ein Auto anhand bestimmter Muster relativ einfach identifiziert werden, wohingegen das angeforderte Modell eines Autos nach dem Baujahr des Ego-Fahrzeugs hergestellt worden sein kann, so dass der Schritt zum Anfordern eines spezialisierten Objekterkennungsverfahren für das mindestens eine Referenzschlüsselwort durch eine externe Datenbank verarbeitet werden sollte, die auf einem hochaktuellen Stand ist.This can represent a meaningful distribution of resources in that the step of requesting the general image recognition process relating to the image can be used for an initial query of simple terms or objects. For example, a car can be relatively easily identified on the basis of certain patterns, whereas the requested model of a car can have been manufactured according to the year of construction of the ego vehicle, so that the step of requesting a specialized object recognition method for the at least one reference key word can be processed by an external database should, which is up to date.

Gemäß einer modifizierten Ausführungsform der Erfindung ist die Benutzeranfrage eine sprachbasierte Anfrage des Benutzers.
Dies kann die Fahrsicherheit erhöhen, da beispielsweise der Fahrer seine Hände permanent am Lenkrad halten kann. Dies führt auch zu einer intuitiven Bedienung des gesamten Systems.According to a modified embodiment of the invention, the user request is a voice-based request from the user.
This can increase driving safety because, for example, the driver can keep his hands on the steering wheel at all times. This also leads to an intuitive operation of the entire system.

Gemäß einer modifizierten Ausführungsform der Erfindung ist die Benutzeranfrage eine gestenbasierte Benutzeranfrage.
Dies kann vorteilhaft sein, wenn im Fahrzeug beispielsweise ein Freisprechanruf ausgeführt wird. Beispielsweise möchte der Benutzer möglicherweise Information über die Umgebung zu Orientierungszwecken erhalten. Für eine sprachbasierte Anfrage des Benutzers (d.h. eine Anfrage die vom Benutzer gesprochen wird) müsste der Telefonanruf abgebrochen werden, weil, wenn die Anfrage gemacht würde während des Telefonats, die Person am anderen Ende der Leitung, mit der der Benutzer spricht, durcheinander bringen könnte, was durch die vorgeschlagene Maßnahme nicht erfolgen muss.According to a modified embodiment of the invention, the user request is a gesture-based user request.
This can be advantageous if, for example, a hands-free call is being made in the vehicle. For example, the user may want to receive information about the environment for orientation purposes. For a voice-based request from the user (i.e. a request spoken by the user), the phone call would have to be abandoned because if the request were made while on the phone, the person on the other end of the line the user is talking to could be confused what does not have to be done by the proposed measure.

Die Erfindung betrifft auch ein System mit einer Einrichtung zum Anwenden einem der vorhergehenden Verfahren.
Ein solches System ermöglicht ein sicheres Fahrgefühl für den Benutzer, dem bei Bedarf jederzeit genaue Information zur Verfügung gestellt werden kann.The invention also relates to a system with a device for applying one of the preceding methods.
Such a system enables a safe driving experience for the user, who can be provided with precise information at any time if necessary.

Gemäß einer modifizierten Ausführungsform der Erfindung weist das blickgesteuerte Interaktionsmodul einen Eye-Tracker und/oder einen Gesten-Tracker auf. Dies ermöglicht eine Analyse einer Aufmerksamkeitszone des Benutzers anhand seines Blicks und/oder seiner Gesten. Der Benutzer muss daher keine großen Mühen aufwenden, um seine Anfragen zu stellen und kann sich besser auf den Straßenverkehr konzentrieren.According to a modified embodiment of the invention, the gaze-controlled interaction module has an eye tracker and / or a gesture tracker. This enables an analysis of an attention zone of the user on the basis of his gaze and / or his gestures. The user therefore does not have to invest a great deal of effort to make his inquiries and can concentrate better on the traffic.

Gemäß einer modifizierten Ausführungsform der Erfindung ist das Assistentenmodul ein Sprachassistentenmodul. Dies kann die Fahrsicherheit erhöhen, da beispielsweise der Fahrer seine Hände permanent am Lenkrad halten kann. Dies führt auch zu einer intuitiven Bedienung des gesamten Systems.According to a modified embodiment of the invention, the assistant module is a voice assistant module. This can increase driving safety because, for example, the driver can keep his hands on the steering wheel at all times. This also leads to an intuitive operation of the entire system.

Gemäß einer modifizierten Ausführungsform der Erfindung ist das Assistentenmodul ein Gestenassistentenmodul. Dies kann vorteilhaft sein, wenn im Fahrzeug beispielsweise ein Freisprechanruf ausgeführt wird. Beispielsweise möchte der Benutzer möglicherweise Information über die Umgebung zu Orientierungszwecken erhalten. Andernfalls müsste der Telefonanruf abgebrochen werden, was durch die vorgeschlagene Maßnahme nicht stattfinden muss.According to a modified embodiment of the invention, the assistant module is a gesture assistant module. This can be advantageous if, for example, a hands-free call is being made in the vehicle. For example, the user may want to receive information about the environment for orientation purposes. Otherwise the phone call would have to be broken off, which does not have to take place due to the proposed measure.

Gemäß einer modifizierten Ausführungsform der Erfindung wird eine interne Speichereinheit eines Fahrzeugs verwendet, um Information offline abzurufen, wobei diese Information insbesondere Orte von Interesse beinhaltet. Dies kann in Gebieten mit schlechtem Empfang vorteilhaft sein, um das Datenstromvolumen so gering wie möglich zu halten und damit eine kurze Ansprechzeit für das System zu gewährleisten. Optional kann der interne Speicher aktualisiert werden, um auch nach einer längeren Zeitspanne aktuelle Information zu erhalten.According to a modified embodiment of the invention, an internal memory unit of a vehicle is used to call up information offline, this information in particular containing places of interest. This can be advantageous in areas with poor reception in order to keep the data flow volume as low as possible and thus ensure a short response time for the system. The internal memory can optionally be updated in order to receive current information even after a longer period of time.

Gemäß einer modifizierten Ausführungsform der Erfindung ist das Bilderkennungsmodul ein externer Server, insbesondere ein abgesetzter Server oder ein Cloud-Server, vorzugsweise unter Verwendung einer Datenbank, bevorzugt basierend auf einer neuronalen Netzwerktechnologie. Die Datenzufuhr von einer externen Quelle stellt sicher, dass immer die aktuellsten Daten abrufbar sind. Beispielsweise können sich die Öffnungszeiten eines Orts von Interesse ändern, oder ein Auto kann ein neues Modell sein. Diese Daten können vorzugsweise auf einem abgesetzten Server oder einem Cloud-Server verarbeitet werden. Insbesondere stellt ein abgesetzter Server Information bereit. Diese Information kann im Fahrzeug ausgewertet werden. Dies kann in Gebieten mit schlechtem Empfang vorteilhaft sein, um das Datenstromvolumen so gering wie möglich zu halten und damit eine kurze Ansprechzeit für das System zu gewährleisten. Zum anderen ist ein Cloud-Server derart designt, dass die eigentliche Datenanalyse auch auf dem Cloud-Server erfolgt. Dies reduziert die erforderliche Rechenkapazität des Autos. Hierbei ist zu berücksichtigen, dass ein Auto normalerweise eine erheblich längere Lebensdauer hat als ein Computergerät. Durch Auslagern der Recheneinheit auf einen Cloud-Server kann bei einem Auto mit mehrjähriger Betriebsdauer verhindert werden, dass das Auto oder seine Recheneinheit für den erforderlichen Datenstrom zu alt oder zu langsam ist. Neuronale Netzwerktechnologie hat den Vorteil, dass sie sich anpassen kann. Beispielsweise können Autos aufgrund der Mode veränderte Konturen aufweisen. Mit einer rekurrenten Analyse kann die neuronale Netzwerktechnologie diese Konturen schneller als Fahrzeugkonturen identifizieren.According to a modified embodiment of the invention, the image recognition module is an external server, in particular a remote server or a cloud server, preferably using a database, preferably based on neural network technology. The data supply from an external source ensures that the latest data is always available. For example, the opening hours of a place of interest may change, or a car may be a new model. These data can preferably be processed on a remote server or a cloud server. In particular, a remote server provides information. This information can be evaluated in the vehicle. This can be advantageous in areas with poor reception in order to keep the data flow volume as low as possible and thus ensure a short response time for the system. On the other hand, a cloud server is designed in such a way that the actual Data analysis also takes place on the cloud server. This reduces the computational capacity of the car. It should be noted that a car usually has a significantly longer lifespan than a computer device. By outsourcing the processing unit to a cloud server, it can be prevented in the case of a car that has been in operation for several years that the car or its processing unit is too old or too slow for the required data flow. Neural network technology has the advantage that it can adapt. For example, cars can have different contours due to fashion. With a recurrent analysis, the neural network technology can identify these contours faster than vehicle contours.

Gemäß einer modifizierten Ausführungsform der Erfindung wird das Sichtfeld der Frontkamera des Fahrzeugs durch ein Sensorsystem unter Verwendung von Radar-, LIDAR-, Ultraschall-, Stereosehen und/oder Infrarottechnologie ergänzt. Dies erhöht die Messgenauigkeit. Es hat sich herausgestellt, dass diese Sensoren zum Messen von Abständen zu Objekten und zum Zuschneiden des richtigen Bereichs nützlich sind.According to a modified embodiment of the invention, the field of view of the front camera of the vehicle is supplemented by a sensor system using radar, LIDAR, ultrasound, stereo vision and / or infrared technology. This increases the measurement accuracy. These sensors have been found to be useful for measuring distances to objects and cropping the correct area.

Die Erfindung betrifft ferner ein Computerprogramm mit Programmcodeanweisungen zum Ausführen der Schritte des Verfahrens nach einer der vorstehenden Maßnahmen.The invention also relates to a computer program with program code instructions for executing the steps of the method according to one of the above measures.

Diese und andere Aspekte der Erfindung werden anhand der nachstehend beschriebenen Ausführungsformen ersichtlich und erläutert. Einzelne Merkmale, die in den Ausführungsformen dargestellt sind, können für sich alleine oder in Kombination einen Aspekt der vorliegenden Erfindung bilden. Merkmale der verschiedenen Ausführungsformen können von einer Ausführungsform auf eine andere Ausführungsform übertragen werden.These and other aspects of the invention will be apparent and illustrated in the embodiments described below. Individual features that are shown in the embodiments can form an aspect of the present invention on their own or in combination. Features of the various embodiments can be transferred from one embodiment to another embodiment.

Es zeigen:

1 ein Ablaufdiagramm gemäß einer bevorzugten Ausführungsform der Erfindung; und
2 beispielhaft, wie die Blickrichtung eines Benutzers bezüglich Objekten digitalisiert wird, um ein Bild des Sichtfelds der Frontkamera des Fahrzeugs zuzuschneiden, gemäß der Ausführungsform von 1.

Show it:

1 a flow chart according to a preferred embodiment of the invention; and
2 an example of how the direction of view of a user with respect to objects is digitized in order to crop an image of the field of view of the front camera of the vehicle, according to the embodiment of FIG 1 .

1 zeigt ein Ablaufdiagramm gemäß einer bevorzugten Ausführungsform der Erfindung. 1 shows a flow chart according to a preferred embodiment of the invention.

Insbesondere zeigt 1 ein bevorzugtes Verfahren zum Beantworten einer Benutzeranfrage, die mit einem Objekt O1, O2, O3 oder einem Bereich außerhalb des Fahrzeugs in Beziehung steht, mit den folgenden Schritten: Zunächst wird eine Benutzeranfrage 100 gemäß einem Objekt O1, O2, O3 oder einem Bereich durch ein Assistentenmodul 1 zu einem Zeitpunkt t erfasst. Hierbei ist der Zeitpunkt t der Zeitpunkt, zu dem die Benutzeranfrage erfasst wird.In particular shows 1 a preferred method of responding to a user query involving an object O1 , O2 , O3 or an area outside the vehicle, with the following steps: First, a user request 100 according to an object O1 , O2 , O3 or an area through an assistant module 1 recorded at a time t. Here, time t is the time at which the user request is recorded.

Als ein nächster Schritt wird gemäß dem angeforderten Objekt oder Bereich geprüft 200, ob mehr Kontextinformation im Vergleich zur vorhandenen Information benötigt wird, um ein Prüfergebnis zu erhalten.As a next step, it is checked 200 according to the requested object or area, whether more context information is required compared to the existing information in order to obtain a check result.

Wenn ein Prüfergebnis falsch F ist, wird die Benutzeranfrage verarbeitet 300 und Information ausgegeben 400. Dies geschieht nach dem Stand der Technik.If a check result is false F, the user request is processed 300 and information output 400 . This is done according to the state of the art.

Wenn ein Prüfergebnis wahr T ist, wird geprüft, ob die Benutzeranfrage eine offene oder eine teilweise offene Anfrage ist. Eine offene Anfrage hat kein Referenzschlüsselwort. Eine teilweise offene Anfrage beinhaltet mindestens ein Referenzschlüsselwort.If a test result is true T is, it is checked whether the user request is an open or a partially open request. An open request has no reference keyword. A partially open request contains at least one reference keyword.

Im Fall einer offenen Anfrage wird durch ein blickgesteuertes Interaktionsmodul 2 die Blickrichtung G des Benutzers erfasst und Information über ein Objekt O1 am Schnittpunkt des Blickvektors mit mindestens einem Sichtfeld einer Frontkamera eines Fahrzeugs bezüglich des Zeitpunkts t abgerufen 500. Allgemein wird nach diesem Schritt ein Bild des Sichtfelds der Frontkamera des Fahrzeugs zugeschnitten 600.In the case of an open request, a gaze-controlled interaction module 2 the direction of view G of the user captures and information about an object O1 at the point of intersection of the gaze vector with at least one field of view of a front camera of a vehicle with respect to the point in time t retrieved 500 . In general, after this step, an image of the field of view of the front camera of the vehicle is cropped 600 .

Das Erfassen der Blickrichtung G des Benutzers kann mit einem Eye-Tracker-System ausgeführt werden, das Teil des blickgesteuerten Interaktionsmoduls 2 sein kann. Wenn das Abrufen von Information über das Objekt O1 am Schnittpunkt des Blickvektors mit mindestens dem Sichtfeld der Frontkamera des Fahrzeugs zum Zeitpunkt t 500 nicht erfolgreich ist, was bedeutet, dass das Objekterkennungsergebnis falsch F ist, wird ein Bildausschnitt mit Standardgröße zugeschnitten 600a. Dies stellt sicher, dass ein weites Bild aufgenommen wird, auf dem irgendwo das durch den Benutzer erblickte und angefragte Objekt O1 vorhanden sein sollte. Somit steht dieses Objekt O1 trotz fehlender Identifizierung in einem weiteren Analyseschritt zur Verfügung. Im Gegensatz dazu wird, wenn das Abrufen von Information über das Objekt O1 am Schnittpunkt des Blickvektors mit mindestens dem Sichtfeld der Frontkamera des Fahrzeugs bezüglich des Zeitpunkts t 500 erfolgreich ist, was bedeutet, dass das Objekterkennungsergebnis wahr T ist, ein Bildausschnitt, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des zugeschnittenen Objekts O1 berechnet wird, zugeschnitten 600b. Somit kann sichergestellt werden, dass ein Bild des durch den Benutzer betrachteten Objekts O1 so genau und umfassend wie möglich aufgenommen wird. Aufgrund des Wegfalls unnötiger Bildinformation können weitere Analyseschritte mit weniger Ressourcen ausgeführt werden, d.h. schneller, energiesparender und insbesondere genauer.Capturing the direction of view G of the user can be performed with an eye-tracker system which is part of the eye-controlled interaction module 2 can be. When getting information about the object O1 at the intersection of the gaze vector with at least the field of view of the front camera of the vehicle at time t 500 is unsuccessful, which means that the object recognition result is false F, a standard-sized portion of the image is cropped 600a . This ensures that a wide image is recorded on which the object seen and requested by the user is somewhere O1 should be present. So this object stands O1 available in a further analysis step despite the lack of identification. In contrast, when getting information about the object O1 at the intersection of the gaze vector with at least the field of view of the front camera of the vehicle with respect to time t 500 is successful, which means that the object recognition result is true T, an image section, its size from the apparent distance and the apparent size of the cropped object O1 is calculated, tailored 600b . It can thus be ensured that an image of the object viewed by the user O1 is recorded as accurately and comprehensively as possible. Due to the elimination of unnecessary image information, further analysis steps can be carried out with less Resources are executed, ie faster, more energy-saving and, in particular, more precisely.

Nach dem Zuschneiden eines Bildausschnitts mit der Standardgröße 600a oder nach dem Zuschneiden eines Bildausschnitts, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des Objekts O1 am Schnittpunkt berechnet wird 600b, wird im nächsten Schritt ein allgemeiner Bilderkennungsprozess bezüglich des Bildes durch ein Bilderkennungsmodul 3 angefordert 900.After cropping a portion of the image at the standard size 600a or after cropping an image section, its size from the apparent distance and the apparent size of the object O1 is calculated at the intersection 600b , the next step is a general image recognition process with regard to the image by an image recognition module 3 requested 900 .

Im Falle einer teilweise offenen Anfrage werden durch ein blickgesteuertes Interaktionsmodul 2 die Blickrichtung G des Benutzers erfasst und Information über ein Objekt O1 am Schnittpunkt des Blickvektors mit mindestens dem Sichtfeld einer Frontkamera eines Fahrzeugs bezüglich des Zeitpunkts t abgerufen 500. Allgemein wird nach diesem Schritt ein Bild des Sichtfelds der Frontkamera des Fahrzeugs zugeschnitten 600.In the case of a partially open request, a gaze-controlled interaction module 2 the direction of view G of the user captures and information about an object O1 at the point of intersection of the gaze vector with at least the field of view of a front camera of a vehicle with respect to the point in time t retrieved 500 . In general, after this step, an image of the field of view of the front camera of the vehicle is cropped 600 .

Das Erfassen der Blickrichtung G des Benutzers kann durch ein Eye-Tracker-System ausgeführt werden, das Teil des blickgesteuerten Interaktionsmoduls 2 sein kann. Wenn das Abrufen von Information über das Objekt O1 am Schnittpunkt des Blickvektors mit mindestens dem Sichtfeld der Frontkamera des Fahrzeugs bezüglich des Zeitpunkts t 500 nicht erfolgreich ist, was bedeutet, dass das Objekterkennungsergebnis falsch F ist, wird ein Bildabschnitt mit der Standardgröße zugeschnitten 600a. Dies stellt sicher, dass ein weites Bild aufgenommen wird, auf dem irgendwo das durch den Benutzer erblickte und angefragte Objekt O1 vorhanden sein sollte. Somit steht dieses Objekt O1 trotz fehlender Identifizierung in einem weiteren Analyseschritt zur Verfügung. Im Gegensatz dazu wird, wenn das Abrufen von Information über das Objekt O1 am Schnittpunkt des Blickvektors mit mindestens dem Sichtfeld der Frontkamera des Fahrzeugs bezüglich des Zeitpunkts t 500 erfolgreich ist, was bedeutet, dass das Objekterkennungsergebnis wahr T ist, ein Bildausschnitt zugeschnitten, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des Objekts O1 am Schnittpunkt berechnet wird 600b. Somit kann sichergestellt werden, dass ein Bild des durch den Benutzer betrachteten Objekts O1 so genau und umfassend wie möglich aufgenommen wird. Aufgrund des Wegfalls unnötiger Bildinformation können weitere Analyseschritte mit weniger Ressourcen ausgeführt werden, d.h. schneller, energiesparender und insbesondere genauer.Capturing the direction of view G of the user can be executed through an eye-tracker system which is part of the eye-controlled interaction module 2 can be. When getting information about the object O1 at the intersection of the gaze vector with at least the field of view of the front camera of the vehicle with respect to time t 500 is unsuccessful, which means that the object recognition result is false F, a portion of the image is cropped with the standard size 600a . This ensures that a wide image is recorded on which the object seen and requested by the user is somewhere O1 should be present. So this object stands O1 available in a further analysis step despite the lack of identification. In contrast, when getting information about the object O1 at the intersection of the gaze vector with at least the field of view of the front camera of the vehicle with respect to time t 500 is successful, which means that the object recognition result is true T, an image section cropped, its size from the apparent distance and the apparent size of the object O1 is calculated at the intersection 600b . It can thus be ensured that an image of the object viewed by the user O1 is recorded as accurately and comprehensively as possible. Because unnecessary image information is omitted, further analysis steps can be carried out with fewer resources, ie faster, more energy-saving and, in particular, more precisely.

Eine Situation zum Erfassen der Blickrichtung G des Benutzers und zum Abrufen von Information über ein Objekt O1 am Schnittpunkt des Blickvektors mit mindestens dem Sichtfeld einer Frontkamera eines Fahrzeugs bezüglich des Zeitpunkts t 500 durch das blickgesteuerte Interaktionsmodul 2 ist in 2 dargestellt. Demgemäß liegt eine Verkehrssituation vor, in der sich mindestens drei Objekte O1, O2, O3 außerhalb des eigenen Fahrzeugs befinden. Wenn der Benutzer eine Anfrage gemacht hat und der nächste Schritt der vorstehend erwähnte Schritt ist, wird erkannt, dass die Blickrichtung G des Benutzers zum Objekt O1 führt. Die anderen Objekte O2 und O3 sind irrelevant. Auch Armaturenbrett-Objekte DO sind irrelevant und nicht Teil des regulären Erfassungsbereichs der Blickrichtung G des Benutzers. Als ein nächster Schritt wird ein Bildausschnitt zugeschnitten, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des Objekts O1 am Schnittpunkt berechnet wird 600b, um ein detailliertes Bild des Objekts O1 zu erhalten. Die Erfassung des Objekts O1 kann vereinfacht werden, wenn es ein sich bewegendes Objekt ist und der Hintergrund stationär ist. Die Erfassung könnte schwieriger sein, wenn das Objekt O1 sich in einem Verkehrsstau befindet und bezüglich des Hintergrunds bewegungslos ist, oder wenn es neblig oder regnerisch ist.A situation for detecting the line of sight G of the user and to get information about an object O1 at the point of intersection of the gaze vector with at least the field of view of a front camera of a vehicle with respect to time t 500 through the gaze-controlled interaction module 2 is in 2 shown. Accordingly, there is a traffic situation in which there are at least three objects O1 , O2 , O3 outside of your own vehicle. When the user has made an inquiry and the next step is the above-mentioned step, it is recognized that the line of sight G of the user to the object O1 leads. The other objects O2 and O3 are irrelevant. Dashboard objects too DO are irrelevant and not part of the regular detection range of the viewing direction G of the user. As a next step, a section of the image is cut, its size based on the apparent distance and the apparent size of the object O1 is calculated at the intersection 600b to get a detailed picture of the object O1 to obtain. The capture of the object O1 can be simplified if it is a moving object and the background is stationary. Capturing could be more difficult if the object O1 is in a traffic jam and motionless with respect to the background, or when it is foggy or rainy.

Ferner ist es gemäß 1 vorgesehen, dass nach dem Zuschneiden eines Bildausschnitts der Standardgröße 600a oder nach dem Zuschneiden eines Bildausschnitts, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des Objekts O1 am Schnittpunkt berechnet wird 600b, der allgemeine Bilderkennungsprozess bezüglich des Bildes angefordert wird 900, und falls dies möglich ist, auch ein spezialisierter Objekterkennungsprozess für das mindestens eine Referenzschlüsselwort durch ein Bilderkennungsmodul 3 angefordert wird 800. Der vorstehend erwähnte letzte Schritt geht mindestens eine Ebene tiefer und versucht, genaue Information über das Objekt zu erhalten. Ein Beispiel könnte sein, dass nicht nur die Objektklasse, beispielsweise Auto, angefragt wird, sondern auch das betrachtete Modell des Autos, zum Beispiel Hersteller oder Farbe des Autos. Die Beantwortung dieser Anfrage ist jedoch nur mit einem irgendwie verfügbaren Referenzschlüsselwort möglich, insbesondere in Bezug auf die übergeordnete Kategorie.Furthermore, it is in accordance 1 provided that after cropping an image section of the standard size 600a or after cropping an image section, its size from the apparent distance and the apparent size of the object O1 is calculated at the intersection 600b , the general image recognition process regarding the image is requested 900 and, if possible, also a specialized object recognition process for the at least one reference key word by an image recognition module 3 is requested 800 . The last step mentioned above goes at least one level deeper and tries to get precise information about the object. An example could be that not only the object class, for example car, is requested, but also the model of the car considered, for example manufacturer or color of the car. However, this query can only be answered with a reference keyword that is somehow available, especially with regard to the superordinate category.

Aus diesem Grund wird in diesem Fall Information des mindestens einen Referenzschlüsselworts extrahiert 700, um den allgemeinen Bilderkennungsprozess bezüglich des Bildes anzufordern 900 und insbesondere einen spezialisierten Objekterkennungsprozess für das mindestens eine Referenzschlüsselwort durch das Bilderkennungsmodul 3 anzufordern 800.For this reason, information of the at least one reference keyword is extracted in this case 700 to request the general image recognition process regarding the image 900 and in particular a specialized object recognition process for the at least one reference keyword by the image recognition module 3 to request 800 .

Falls nach dem Anfordern des spezialisierten Objekterkennungsprozesses für das mindestens eine Referenzschlüsselwort 800 oder nach dem Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes 900 ein bildbezogener Vorschlag P erhalten wird, werden jegliche bildbezogenen Vorschläge 1000 zusammengeführt. In diesem Schritt wird sämtliche neue Kontextinformation zu Gruppen zusammengefasst.If after requesting the specialized object recognition process for the at least one reference keyword 800 or after requesting the general image recognition process on the image 900 an image-related suggestion P will be any image related proposals 1000 merged. In this step, all new context information is combined into groups.

Nach dem Zusammenführen jedweder bildbezogener Vorschläge 1000 zu neuer Kontextinformation wird die neue Kontextinformation der vorhandenen Information gemäß der Benutzeranfrage durch das Assistentenmodul 1 hinzugefügt 1100. In diesem Schritt wird alle auszugebende Information zusammen gruppiert. Schließlich wird die Benutzeranfrage verarbeitet 300 und die Information ausgegeben 400.After merging any image related suggestions 1000 the new context information of the existing information according to the user request by the assistant module becomes the new context information 1 added 1100 . In this step, all information to be output is grouped together. Finally, the user request is processed 300 and output the information 400 .

In dem Fall, dass kein bildbezogener Vorschlag NP erhalten wird, nachdem der allgemeine Bilderkennungsprozess 900 angefordert wurde, wird die Benutzeranfrage 300 verarbeitet und die Information 400 ausgegeben. In diesem Fall kann die Hauptannahme sein, dass die Information darin besteht, dass keine Information ausgegeben werden kann.In the event that no image-related suggestion NP is obtained after the general image recognition process 900 was requested, the user request 300 processed and the information 400 issued. In this case, the main assumption may be that the information is that no information can be output.

Gemäß einer modifizierten Ausführungsform der Erfindung kann das Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes durch das Bilderkennungsmodul 3 mindestens ein Referenzschlüsselwort auslösen, so dass das Anfordern eines spezialisierten Objekterkennungsprozesses für das mindestens eine Referenzschlüsselwort der nächste Schritt 800 ist. Da durch das Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes 900 durch das Bilderkennungsmodul 3 ein Referenzschlüsselwort gefunden werden kann, ist es möglich, eine offene Anfrage nachträglich und ohne genaue Kenntnis des Benutzers in eine teilweise offene Anfrage zu ändern. Wenn ein Benutzer beispielsweise anfragt, um was für eine Art von Objekt es sich beim betrachteten Objekt O1 handelt, kann durch Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes 900 von einer externen Datenbank ein Auto auf einem Abschleppwagen identifiziert werden. Das System kann die Information „Auto auf einem Abschleppwagen“ in einem nächsten Schritt zum Anfordern eines speziellen Objekterkennungsprozesses intern als das mindestens eine Referenzschlüsselwort verwenden 800, um genau herauszufinden, um welches Modell es sich bei diesem Auto oder diesem Abschleppwagen handelt.According to a modified embodiment of the invention, the request for the general image recognition process relating to the image by the image recognition module 3 trigger at least one reference keyword, so that requesting a specialized object recognition process for the at least one reference keyword is the next step 800 is. Because by requesting the general image recognition process regarding the image 900 through the image recognition module 3 If a reference keyword can be found, it is possible to subsequently change an open request to a partially open request without the user having precise knowledge. For example, when a user asks what kind of object the viewed object is O1 can by requesting the general image recognition process on the image 900 a car on a tow truck can be identified by an external database. The system can use the information “car on a tow truck” in a next step for requesting a special object recognition process internally as the at least one reference keyword 800 to find out exactly what model this car or tow truck is.

Das Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes 900 und/oder das Anfordern eines spezialisierten Objekterkennungsprozesses für das mindestens eine Referenzschlüsselwort 800 werden extern ausgeführt, wobei das in 1 beispielhaft dargestellte Bilderkennungsmodul 3 ein abgesetzter Server oder ein Cloud-Server, vorzugsweise unter Verwendung einer Datenbank, bevorzugt basierend auf neuronaler Netzwerktechnologie ist.Requesting the general image recognition process on the image 900 and / or requesting a specialized object recognition process for the at least one reference keyword 800 are executed externally, with the in 1 image recognition module shown as an example 3 a remote server or a cloud server, preferably using a database, preferably based on neural network technology.

Die Datenzufuhr von einer externen Quelle stellt sicher, dass immer die aktuellsten Daten abrufbar sind. Beispielsweise können sich die Öffnungszeiten eines Orts von Interesse ändern, oder ein Auto kann ein neues Modell sein. Diese Daten können vorzugsweise auf einem abgesetzten Server oder einem Cloud-Server verarbeitet werden. Insbesondere stellt ein abgesetzter Server Information bereit. Diese Information kann im Fahrzeug ausgewertet werden. Dies kann in Gebieten mit schlechtem Empfang vorteilhaft sein, um das Datenstromvolumen so gering wie möglich zu halten und damit eine kurze Ansprechzeit für das System zu gewährleisten. Zum anderen ist ein Cloud-Server derart konfiguriert, dass die eigentliche Datenanalyse auch auf dem Cloud-Server stattfindet. Dies reduziert die erforderliche Rechenkapazität des Autos. Hierbei ist zu berücksichtigen, dass ein Auto normalerweise eine erheblich längere Lebensdauer hat als ein Computergerät. Durch Auslagern der Recheneinheit auf einen Cloud-Server kann bei einem Auto mit mehrjähriger Betriebsdauer verhindert werden, dass das Auto oder seine Recheneinheit für den erforderlichen Datenstrom zu alt oder zu langsam ist.The data supply from an external source ensures that the latest data is always available. For example, the opening hours of a place of interest may change, or a car may be a new model. These data can preferably be processed on a remote server or a cloud server. In particular, a remote server provides information. This information can be evaluated in the vehicle. This can be advantageous in areas with poor reception in order to keep the data flow volume as low as possible and thus ensure a short response time for the system. On the other hand, a cloud server is configured in such a way that the actual data analysis also takes place on the cloud server. This reduces the computational capacity of the car. It should be noted that a car usually has a significantly longer lifespan than a computer device. By outsourcing the computing unit to a cloud server, it can be prevented in a car that has been in operation for several years that the car or its computing unit is too old or too slow for the required data flow.

Neuronale Netzwerktechnologie hat den Vorteil, dass sie sich anpassen kann. Beispielsweise können Autos aufgrund der Mode veränderte Konturen aufweisen. Mit einer rekurrenten Analyse kann die neuronale Netzwerktechnologie diese Konturen schneller als Fahrzeugkonturen identifizieren.Neural network technology has the advantage that it can adapt. For example, cars can have different contours due to fashion. With a recurrent analysis, the neural network technology can identify these contours faster than vehicle contours.

Es ist möglich, dass das Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes 900 primär im Fahrzeug und nur dann extern ausgeführt wird, wenn eine weitere Nachfrage besteht, wobei das Anfordern eines spezialisierten Objekterkennungsprozesses für das mindestens eine Referenzschlüsselwort 800 immer extern ausgeführt wird.It is possible that requesting the general image recognition process regarding the image 900 is carried out primarily in the vehicle and only externally when there is further demand, with the request of a specialized object recognition process for the at least one reference keyword 800 is always executed externally.

Dies kann eine sinnvolle Verteilung von Ressourcen insoweit darstellen, als der Schritt zum Anfordern des allgemeinen Bilderkennungsprozesses bezüglich des Bildes 900 für eine anfängliche Anfrage einfacher Begriffe oder Objekte verwendet werden kann. Beispielsweise kann ein Auto anhand bestimmter Muster relativ einfach identifiziert werden, wohingegen das angefragte Modell eines Autos nach dem Baujahr des Ego-Fahrzeugs hergestellt worden sein kann, so dass der Schritt zum Anfordern eines spezialisierten Objekterkennungsprozesses für das mindestens eine Referenzschlüsselwort 800 durch eine externe Datenbank verarbeitet werden sollte, die auf einem hochaktuellen Stand ist. Für Daten einer Datenbank können anstelle eines Autos andere Beispielen sein Marken oder Logos, Gebäude oder spezielle Architekturen. Auch andere gleichwertige Lösungen sind beabsichtigt.This can represent a meaningful distribution of resources insofar as the step of requesting the general image recognition process with respect to the image 900 can be used for an initial query of simple terms or objects. For example, a car can be relatively easily identified on the basis of certain patterns, whereas the requested model of a car can have been manufactured according to the year of construction of the ego vehicle, so that the step of requesting a specialized object recognition process for the at least one reference keyword 800 should be processed by an external database that is up-to-date. Instead of a car, other examples of data in a database can be brands or logos, buildings or special architectures. Other equivalent solutions are also envisaged.

Gemäß einer modifizierten Ausführungsform der Erfindung ist, wie in 1 dargestellt ist, die Benutzeranfrage eine sprachbasierte Benutzeranfrage.According to a modified embodiment of the invention, as in 1 is shown, the user request is a voice-based user request.

Dies kann die Fahrsicherheit erhöhen, da beispielsweise der Fahrer seine Hände permanent am Lenkrad halten kann. Dies führt auch zu einer intuitiven Bedienung des gesamten Systems.This can increase driving safety because, for example, the driver can keep his hands on the steering wheel at all times. This also leads to an intuitive operation of the entire system.

Zusätzlich oder alternativ zu der in 1 dargestellten Ausführungsform der Erfindung ist die Benutzeranfrage eine gestenbasierte Benutzeranfrage. Dies kann vorteilhaft sein, wenn im Fahrzeug beispielsweise ein Freisprechanruf ausgeführt wird. Beispielsweise könnte der Benutzer wünschen, Information über die Umgebung zu Orientierungszwecken erhalten. Andernfalls müsste das Telefonat abgebrochen werden, was durch die vorgeschlagene Maßnahme nicht erfolgen muss.Additionally or as an alternative to the in 1 The illustrated embodiment of the invention, the user request is a gesture-based user request. This can be advantageous if, for example, a hands-free call is being made in the vehicle. For example, the user might wish to receive information about the environment for orientation purposes. Otherwise the phone call would have to be broken off, which does not have to be done with the proposed measure.

Mit anderen Worten, es ist möglich, dass das System dafür konfiguriert ist, sprachbasierte oder gestenbasierte Benutzeranfragen zu erfassen.In other words, it is possible for the system to be configured to capture voice-based or gesture-based user requests.

Die Erfindung betrifft auch ein System zum Anwenden des vorstehend erwähnten Verfahrens gemäß einer der vorstehenden Maßnahmen, mit einer Einrichtung gemäß einem der vorstehenden Merkmale. Ein solches System ermöglicht dem Benutzer ein sicheres Fahrgefühl, wobei dem Benutzer bei Bedarf jederzeit genaue Information zur Verfügung gestellt werden kann.The invention also relates to a system for applying the above-mentioned method according to one of the above measures, with a device according to one of the above features. Such a system enables the user to have a safe driving experience, and precise information can be made available to the user at any time if necessary.

Gemäß einer modifizierten Ausführungsform der Erfindung weist das Interaktionsmodul 2 einen Eye-Tracker und/oder einen Gesten-Tracker auf. Dies ermöglicht eine Analyse des Benutzers basierend auf seinem Blick und/oder seinen Gesten. Der Benutzer muss daher keine großen Anstrengungen machen, um seine Anfragen zu stellen und kann sich besser auf den Straßenverkehr konzentrieren.According to a modified embodiment of the invention, the interaction module 2 an eye tracker and / or a gesture tracker. This enables an analysis of the user based on his gaze and / or his gestures. The user therefore does not have to make a great effort to make his inquiries and can concentrate better on the traffic.

Gemäß einer modifizierten Ausführungsform der Erfindung ist das Assistentenmodul 1 ein Sprachassistentenmodul. Dies kann die Fahrsicherheit erhöhen, da beispielsweise der Fahrer seine Hände permanent am Lenkrad halten kann. Dies führt auch zu einer intuitiven Bedienung des gesamten Systems.According to a modified embodiment of the invention, the assistant module 1 a voice assistant module. This can increase driving safety because, for example, the driver can keep his hands on the steering wheel at all times. This also leads to an intuitive operation of the entire system.

Gemäß einer modifizierten Ausführungsform der Erfindung ist das Assistentenmodul 1 ein Gestenassistentenmodul.According to a modified embodiment of the invention, the assistant module 1 a gesture assistant module.

Dies kann vorteilhaft sein, wenn im Fahrzeug beispielsweise ein Freisprechanruf ausgeführt wird. Beispielsweise kann der Benutzer wünschen, Information über die Umgebung zu Orientierungszwecken zu erhalten. Andernfalls müsste das Telefonat abgebrochen werden, was durch die vorgeschlagene Maßnahme nicht erfolgen muss.This can be advantageous if, for example, a hands-free call is being made in the vehicle. For example, the user may wish to receive information about the environment for orientation purposes. Otherwise the phone call would have to be broken off, which does not have to be done with the proposed measure.

Gemäß einer modifizierten Ausführungsform der Erfindung wird eine interne Speichereinheit eines Fahrzeugs verwendet, um Information offline abzurufen, wobei diese Information insbesondere Orte von Interesse beinhaltet. Dies kann in Gebieten mit schlechtem Empfang vorteilhaft sein, um das Datenstromvolumen so gering wie möglich zu halten und damit eine kurze Ansprechzeit für das System zu gewährleisten. Optional kann der interne Speicher aktualisiert werden, um auch nach einer längeren Zeitspanne aktuelle Information zu empfangen.According to a modified embodiment of the invention, an internal memory unit of a vehicle is used to call up information offline, this information in particular containing places of interest. This can be advantageous in areas with poor reception in order to keep the data flow volume as low as possible and thus ensure a short response time for the system. The internal memory can optionally be updated in order to receive current information even after a longer period of time.

Gemäß einer modifizierten Ausführungsform der Erfindung wird das Sichtfeld der Frontkamera des Fahrzeugs durch ein Sensorsystem unter Verwendung von Radar-, LIDAR-, Ultraschall-, Stereosehen und/oder Infrarottechnologie ergänzt. Dies erhöht die Messgenauigkeit. Beispielsweise kann es bei Nacht schwierig sein, ein Bild, das ausschließlich durch eine Kamera aufgenommen wurde, auszuwerten. Andere sensorische Systeme können hier helfen.According to a modified embodiment of the invention, the field of view of the front camera of the vehicle is supplemented by a sensor system using radar, LIDAR, ultrasound, stereo vision and / or infrared technology. This increases the measurement accuracy. For example, it can be difficult at night to evaluate an image that was recorded exclusively by a camera. Other sensory systems can help here.

BezugszeichenlisteList of reference symbols

11: AssistentenmodulAssistant module
22: Blickgesteuertes InteraktionsmodulEye-controlled interaction module
33: BilderkennungsmodulImage recognition module
100100: Erfassen einer BenutzeranfrageCapture a user request
200200: Prüfen, ob weitere Information erforderlich istCheck if more information is required
300300: Verarbeiten der BenutzeranfrageProcess the user request
400400: Ausgabe von InformationOutput of information
500500: Erfassen der Blickrichtung des Benutzers und Abrufen von Information über ein Objekt an einem Schnittpunkt des Blickvektors mit mindestens dem Sichtfeld der Frontkamera eines Fahrzeugs bezüglich des Zeitpunkts der BenutzeranfrageDetecting the direction of view of the user and retrieving information about an object at an intersection of the gaze vector with at least the field of view of the front camera of a vehicle with regard to the time of the user request
600600: Zuschneiden eines Bilds des Sichtfelds der Frontkamera des FahrzeugsCropping an image of the field of view of the vehicle's front camera
600a600a: Zuschneiden eines Bildausschnitts in einer StandardgrößeCropping a section of an image in a standard size
600b600b: Zuschneiden eines Bildausschnitts, dessen Größe aus dem scheinbaren Abstand und der scheinbaren Größe des Objekts am Schnittpunkt berechnet wirdCropping a section of an image, the size of which is calculated from the apparent distance and the apparent size of the object at the point of intersection
700700: Extrahieren von Information des mindestens einen ReferenzschlüsselwortsExtracting information from the at least one reference keyword
800800: Anfordern eines spezialisierten Objekterkennungsprozesses für das mindestens eine ReferenzschlüsselwortRequesting a specialized object recognition process for the at least one reference keyword
900900: Anfordern eines allgemeinen Bilderkennungsprozesses bezüglich des BildesRequesting a general image recognition process regarding the image
10001000: Zusammenführen jeglicher bildbezogenen VorschlägeMerge any image-related suggestions
11001100: Hinzufügen der neuen Kontextinformation Adding the new context information
FF.: FalschNot correct
TT: WahrTrue
tt: Zeitpunkt, zu dem eine Benutzeranfrage erfasst wirdTime at which a user request is recorded
GG: Blickrichtung des BenutzersDirection of view of the user
PP: Bildbezogener VorschlagImage-related suggestion
NPNP: Kein bildbezogener VorschlagNo image related suggestion
O1O1: Objekt 1Object 1
O2O2: Objekt 2Object 2
O3O3: Objekt 3Object 3
DODO: Armaturenbrett-ObjektDashboard object

ZITATE ENTHALTEN IN DER BESCHREIBUNGQUOTES INCLUDED IN THE DESCRIPTION

Diese Liste der vom Anmelder aufgeführten Dokumente wurde automatisiert erzeugt und ist ausschließlich zur besseren Information des Lesers aufgenommen. Die Liste ist nicht Bestandteil der deutschen Patent- bzw. Gebrauchsmusteranmeldung. Das DPMA übernimmt keinerlei Haftung für etwaige Fehler oder Auslassungen.This list of the documents listed by the applicant was generated automatically and is included solely for the better information of the reader. The list is not part of the German patent or utility model application. The DPMA assumes no liability for any errors or omissions.

Zitierte PatentliteraturPatent literature cited

DE 102016221983 A1 [0004]

Claims

Method for answering a user request which is related to an object (O1, O2, O3) or an area outside the vehicle, with the following steps: - Detection of a user request (100) by an assistant module (1) at a point in time (t ); - Checking whether more context information relating to the requested object or area is required compared to the existing information (200) in order to obtain a checking result; and if a test result is true (T), checking whether the user request is an open or a partially open request, an open request having no reference keyword and a partially open request having at least one reference keyword; - In the case of an open request, or a partially open request, recording the direction of view (G) of the user and retrieving information about an object (O1) at the intersection of the view vector with the field of view of the front camera of a vehicle with respect to a point in time (t) (500 ) through a gaze-controlled interaction module (2) and cropping an image of the field of view of the front camera of the vehicle (600) by the gaze-controlled interaction module (2); Requesting a general image recognition process relating to the image (900) by an image recognition module (3) after cropping an image of the field of view of the front camera of the vehicle (600); characterized by , in the case of a partially open request, extracting information from the at least one reference keyword (700) in order to request (900) the general image recognition process relating to the image by the image recognition module (3).

Procedure according to Claim 1 , characterized by - merging any image-related suggestions (P) (1000) in the case of receipt of an image-related suggestion (P) from some recognition process (800, 900), - after merging any image-related suggestions (1000) to new context information: adding the new Context information (1100) according to the user request by the assistant module (1) for the existing information, whereupon the user request (300) is processed and information (400) is output; and - processing the user request (300) and outputting information (400) if no image-related proposal is received (NP) after the general image recognition process (900) has been requested.

Procedure according to Claim 1 or 2 , characterized by - in the case of an open request the cropping of an image of the field of view of the front camera of the vehicle (600) has two options, namely - if the retrieval of information about the object (O1) at the intersection of the gaze vector with at least the field of view of the front camera of the Vehicle is unsuccessful (F) with respect to time (t) (500), an image section is cropped with a standard size (600a); - If the retrieval of information about the object (O1) at the intersection of the gaze vector with at least the field of view of the front camera of the vehicle with respect to the time (t) (500) is successful (T), an image section is cut, the size of which is based on the apparent distance and the apparent size of the object (O1) at the intersection is calculated (600b); - wherein after cropping an image section with the standard size (600a) or after cropping an image section, the size of which is calculated from the apparent distance and the apparent size of the object (O1) at the intersection point (600b), the general image recognition process with regard to the image ( 900) is requested by the image recognition module (3).

Method according to one of the preceding claims, characterized by - in the case of a partially open request, the cropping of an image of the field of view of the front camera of the vehicle (600) has two options, namely - if information about the object (O1) is called up at the intersection the gaze vector is unsuccessful (F) with at least the field of view of the front camera of the vehicle with respect to the point in time (t) (500), an image section with a standard size is cropped (600a); - If the retrieval of information about the object (O1) at the intersection of the gaze vector with at least the field of view of the front camera of the vehicle with respect to the time (t) (500) is successful (T), an image section is cut, the size of which is based on the apparent distance and the apparent size of the object (O1) at the intersection is calculated (600b); - after cropping an image section with the standard size (600a) or after cropping an image section, the size of which is calculated from the apparent distance and the apparent size of the object (O1) at the intersection point (600b), the general image recognition process relating to the image ( 900) requested by the image recognition module (3).

Method according to one of the preceding claims, characterized in that in the case of a partially open request - After cropping an image of the field of view of the front camera of the vehicle (600) by an image recognition module (3), a general image recognition process is requested with regard to the image (900) and, if this is possible, a special object recognition process for the at least one reference key word (800) is requested; - wherein the extraction of information of the at least one reference keyword is carried out (700) in order to request (900) the general image recognition process relating to the image by the image recognition module (3) and, if this is possible, also to carry out a specialized object recognition process for the at least one reference keyword request (800).

Method according to one of the preceding claims, characterized in that the requesting of the general image recognition process with regard to the image (900) by the image recognition module (3) can trigger at least one reference keyword, so that the request of a specialized object recognition process for the at least one reference keyword is the next step ( 800) is.

Method according to one of the preceding claims, characterized in that the requesting of the general image recognition process with regard to the image (900) and / or the requesting of a specialized object recognition process for the at least one reference key word (800) are carried out externally, the image recognition module (3) being a separate one Server or a cloud server or at least one such server is used.

Method according to one of the preceding claims, characterized in that the requesting of the general image recognition process with respect to the image (900) is primarily carried out in the vehicle and only when there is a further request externally, with the requesting of a specialized object recognition process for the at least one reference keyword (800) always is executed externally.

Method according to one of the preceding claims, characterized in that the user request is spoken by a user or is a gesture-based user request.

Computer program with program code instructions for carrying out the steps of the method according to one of the Claims 1 to 9 .