On the Use of Simulated Future Information for Evaluating Game Situations

Yudai Suzuki¹⁴ &
Tomoharu Nakashima¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11531))

Included in the following conference series:

Robot World Cup

1681 Accesses
5 Citations

Abstract

A FOrward Simulation for Situation Evaluation (FOSSE) approach for evaluating game situations is proposed in this paper. FOSSE approach considers multiple future situations to quantitatively evaluate the current game situations. Since future situations are not available during an ongoing game in real time, they are generated by what is called forward simulation. Then the current game situation is evaluated using the future game situations as well as the current situation itself. First, we show the evaluation performance can be increased by using successive situations in time through preliminary experiments. Especially, the effectiveness of using future information rather than using past information is shown. Then, we present FOSSE approach where both the current and the future information of game situations are used to evaluate the current game situation. In the FOSSE approach, the future game situations are generated by forward simulation. Computational experiments are conducted to investigate the effectiveness of the proposed approach.

You have full access to this open access chapter, Download conference paper PDF

Gaming Simulation Design for Individual and Team Situation Awareness

Human Strategic Reasoning in Dynamic Games: Experiments, Logics, Cognitive Models

Survey: Development and Analysis of a Games-Based Crisis Scenario Generation System

Keywords

1 Introduction

In sports, it is useful to perceive the superiority during a game. If the game situation can be evaluated quantitatively, the degree of dominance for teams can be accurately grasped. Furthermore, it is considered that the quantitative evaluation can be applied to strategy switching guidelines in a game and to the field of automatic live broadcasting of sports. However, quantitative evaluation is difficult in the dynamic game situation. For this problem, we employ machine learning method for quantitative evaluation.

As an experimental environment of this research, we use RoboCup Soccer Simulation 2D League [1]. For the metric of evaluating game situation, Pomas and Nakashima [2] proposed a metric called SituationScore which represents the degree of dominance in a soccer game. This paper also uses SituationScore with a minor modification to evaluate the game situation.

In general, only the current situation is considered when evaluating the game situation. However, since the game progresses dynamically, and accordingly the situation drastically changes, especially in soccer, it is difficult to capture the degree of the dominance in the game with only a single situation information. In this paper, we investigate the use of multiple situations to capture the degree of the dominance in a game.

If future information is available during a game, it is possible to evaluate the game situation with higher accuracy than with only the current information as well as the past information. However, such future information is not available during an ongoing game. To solve this problem, we propose a FOrward Simulation for Situation Evaluation (FOSSE) approach for learning a machine learning model that generates future situations by simulation and then evaluates the current situation by using the generated future situations as well as the current one itself.

The proposed FOSSE approach consists of two parts. The first part is forward simulation for generating the estimated future game situations. The other one is situation evaluation for producing the value of SituationScore from the time series of game situations. We employ a Recurrent Neural Network (RNN) as the simulation model and a Deep Neural Network (DNN) as the evaluation model.

In the following sections, firstly, we show that the prediction accuracy of an evaluation model can be improved by using multiple-situation information comparing with a single-situation model. Secondly, the experiment using actual data shows that future information is more helpful than past information in an evaluation model. Finally, we indicate the effectiveness of the proposed method based on FOSSE approach through computational experiments.

It should be noted that the meaning of “evaluation” in this paper is to understand the field situation such as the degree of domination by a currently attacking team and the likeliness of scoring by that team. There are other research where the evaluation means the value of a state or an action in determining the next action by an individual player agent. Although the situation evaluation in this paper can also be used for such purpose in the future, this is not the focus of this paper. We focus on the evaluation of a field situation not from the view point of the soccer players who can only see the situation in their visual area, but from the view point of a coach or spectators who can watch the whole soccer field.

2 Quantitative Evaluation of Game Situations in RoboCup Soccer

We employ RoboCup Soccer Simulation 2D League [1] as the subject of study in this paper. Generally, as a measure to represent the degree of team dominance in a game, commonly used information would be the ball-possessing team and the ball location in the soccer field. However, such simple indices cannot accurately grasp the degree of team dominance. Therefore, another index that quantitatively expresses the game situation is required.

Pomas and Nakashima [2] proposed SituationScore, which represents the value of a game situation. In their work, a game situation is quantitatively evaluated by using time cycles until the next goal. This paper uses the same idea of SituationScore with a minor modification.

This section first introduces the RoboCup Soccer Simulation 2D League. Then, some modification to SituationScore is presented.

2.1 RoboCup Soccer Simulation 2D League

RoboCup [1] is a research project that focuses on the development of robotics and artificial intelligence. There are various leagues in this project. RoboCup Soccer Simulation 2D League is one of such leagues. It does not use real soccer robots but simulated soccer players. The players are represented by a two-dimensional circle as shown in Fig. 1. They play soccer in a two-dimensional virtual soccer field that is set up on a computer. The positions of the players and the ball are represented as a two-dimensional vector. Each player is programmed as an independent agent unlike a video soccer game where there is one central system that control all the objects such as all players. A game consists of 6,000 time cycles and one cycle is discretized in 0.1 s. When the game is over, a game log is generated in which all the game information such as the position coordinates of the player and the ball in each cycle are included.

2.2 Modification to SituationScore

Pomas and Nakashima [2] proposed a metric called SituationScore. This metric represents the value of a game situation. The value of SituationScore increases as the game situation is close to the time of goal scoring. In its original definition, the maximum value of SituationScore was 100 (when the left team scores), and the minimum value was $-100$ (when the right team scores). In the original definition of SituationScore, the superiority and inferiority of the teams are considered for all the time cycles. However, it is difficult to predict the value of SituationScore when it is close to zero, which is a boundary situation between the superiority and the inferiority of the teams. Due to this problem, some changes were made to SituationScore in this paper so that the lower limit is set to 0 assuming that SituationScore presents only the degree of dominance of either one team. Also, because it was also difficult to correctly predict the value of situations far from the goal, we only consider those situations where a goal is scored within 50 time cycles. As a result, in this paper, we slightly modify the definition of SituationScore as follows:

$$\begin{aligned} SituationScore (t) = 50-n, \end{aligned}$$

(1)

where n represents the number of remaining cycles from t until the next score. In this paper, the range of SituationScore is $0 \le SituationScore \le +50$, which means that we only consider the goals by the left team. The value of SituationScore for the right team can be separately defined by switching the sign of the value (i.e., positive to negative). Figure 2 shows an example game situation which is nine time cycles before the left team scores a goal along with its SituationScore.

2.3 Dataset

The dataset in the computational experiments in this paper was generated by the following steps:

1.
A game between HELIOS2018 [3] and agent2d [4] is performed for a specified number of times.
2.
The log files of the games are analyzed by using Python scripts to detect at which cycles goals were scored.
3.
The numerical information of the soccer field for 50 time cycles before each goal of the left team (i.e., HELIOS2018 [3]) are recorded as well as their corresponding SituationScore values. The recorded information is saved in a file for each of the time cycles. The numerical information includes the position of 22 players and the ball. The value of SituationScore is calculated as in (1). This SituationScore value is used as the ground truth for the situation evaluation.

A dataset containing the numerical field information for about 394,350 time cycles was constructed from 1,000 games. This dataset was then split into three parts as follows: training data ($5,490 \times 50$ time cycles), validation data ($788 \times 50$ time cycles), and test data ($1,609 \times 50$ time cycles). In the rest of this paper, we use this dataset for all experiments.

3 Situation Evaluation with Multiple Situations

3.1 Evaluation Model

This section presents the investigation into the effect of using multiple situations on the accuracy of the trained model for situation evaluation. We employ a simple DNN as an evaluation model of game situations. This model produces the value of SituationScore at time cycle t. The overview of the DNN model is shown in Fig. 3. In this figure, $\varvec{X}$ is the information of the game situation such as the position of the players and the ball. $\varvec{X}_t$ is the information of the current (time cycle t) game state. $\varvec{X}_{t-n_p}$ is the past state information (i.e., $n_p$ time cycles before the current time cycle). $\varvec{X}_{t+n_f}$ is the future state information (i.e., $n_f$ time cycles after the current time cycle).

Numerical experiments are conducted in the next subsection in order to evaluate the performance of the trained model with various combination of input game situations.

3.2 Experiment

Experimental Settings. The purpose of the experiments in this section is to examine the usefulness of using multiple field information with successive time cycles for evaluating the field situation (i.e., predicting the value of SituationScore). We compare the following four models with different combinations of game situations for the input of the DNN.

Model 1::: Single situation (only the current game situation)
Model 2::: Multiple situations (the current, past, and future game situations)
Model 3::: Multiple situations (the current and past game situations)
Model 4::: Multiple situations (the current and future game situations)

Each architecture is shown in Figs. 4, 5, 6 and 7. The number of hidden layers is fixed to 20 for all models, each hidden layer has 16 units, and the layers are fully-connected. For the training of the DNNs, we set the batch size to 64, and used Adam [5] optimizer with $\text {the initial learning rate} = 0.001, \beta _1 = 0.9, \beta _2 = 0.999$. Table 1 indicates the experimental settings. The dimensionality of input data in each situation is one of the following three types: two (the $x-y$ coordinates of the ball position), 24 (the $x-y$ coordinates of the ball position and the left team player’s positions), and 46 (the $x-y$ coordinates of the ball position and the all player’s positions). Past and future information of 5 time cycles are used for Models 2, 3, and 4.

Table 1. Experimental settings.

Full size table

We use Mean Absolute Error (MAE) as the quality measure of the trained model’s accuracy.

Results. The experimental results are shown in Table 2. This table shows the effectiveness of using multiple situations compared with single situation. We can see that Model 1 with only a single situation (i.e., the current game situation) for input produced the largest value of MAE for all experimental settings. This is because the dominance trend in the dynamic game is captured by using multiple situations.

In addition, the effectiveness of using future information is represented. It turns out that future information is more effective than past information for evaluating the game situation. It is important to consider for evaluating situation, how the game is going to develop from the current situation, not how the game developed up to the current situation.

The advantage of using multiple situations with future information is also demonstrated in the experimental results. Nevertheless, it has the problem in using future information as the model’s input. That is, the problem is that the future information is not available during ongoing game in the real time. If there is a way to obtain future information, that would be helpful for the situation evaluation. The next section describes the proposed method that is the solution for this problem.

Table 2. Experimental results.

Full size table

4 FOSSE Approach for Evaluating Field Situation

4.1 FOSSE Approach

In the last section, it was shown that using past and future multiple situations helps enhance the performance of the trained model for situation evaluation. Especially, using future situations produced the best accuracy among the considered four models. There is, however, a problem in real-time application that the future information is not available during an ongoing game. To solve this problem, we propose FOSSE (FOward Simulation for Situation Evaluation) approach. Figure 8 shows the overview of FOSSE approach. This approach consists of two parts: forward simulation part and situation evaluation part. The forward simulation part generates the estimation of the future information from the current and the past game situations. Using the generated future information as well as the past and the current field information, the situation evaluation part produces the value of SituationScore at time cycle t. The following subsections explain each part of FOSSE approach.

In this section, firstly, we explain forward simulation in detail. Secondly, we explain the method to evaluate situation by FOSSE approach. Finally, the computational experiments are conducted to show the effectiveness of the proposed methods.

4.2 Forward Simulation

The forward simulation part is shown Fig. 9. The forward simulation takes the past situations as input and generates the estimated field situation of the future.

Recurrent Neural Network. RNN is a type of neural networks that deals with time series data through its iterative use. It takes the output vector from the previous RNN block at time $t-1$ and the game situation at time cycle t as input. The output vector is used as the input of the next RNN block at time $t+1$. The future game situations are simulated through the above process. This process is called forward simulation for predicting the field situation of future time cycles (i.e., the future game situations). This process is shown in Fig. 10. This figure shows the process of generating the future game situation at time cycle $t+1$ with a time series of previous game situations from time cycle $t-n$ to time cycle t. Each piece of information in the time series $\{\varvec{X}_{t-n},\ldots ,\varvec{X}_t\}$ is processed by the same block. The block is generally represented as a hidden layer of the RNN. After the last piece of the time series is processed by the block, the estimated next situation is generated after a fully-connected layer (FC).

Related Work. There are several works that are related to the forward simulation using RNNs. Khosroshahi et al. [6] realized the trajectory prediction of surrounding vehicles. They look ahead for automatic driving of vehicles. Also, in the field of health care, Choi et al. [7] presented a work where physicians’ diagnosis and dosing order for patients were predicted by RNNs using the vast amount of time series data obtained from electronic medical records.

Shi et al. [8] uses Long Short-Term Memory (LSTM) [9], an extended version of the RNN. They applied the LSTM for flight trajectory prediction. Although this task seemed more difficult than simple vehicle trajectory prediction, a high prediction performance was demonstrated. Alahi et al. [10] tackled the problem of tracking the people in the crowd with the LSTM by introducing social pooling which shares the information of the neighboring persons.

In the above related works, they indicated that the RNN can successfully predict future situations from time series data. Furthermore, they also indicated that the effectiveness of the LSTM even in the difficult tasks. Based on these discussions, this paper also employs the LSTM as an architecture of RNN for the forward simulation part (i.e., we use the LSTM for the iterative block in Fig. 10).

Experiment. In the computational experiments of this subsection, we investigate the accuracy of the forward simulation using the LSTM. Specifically, we investigate the prediction accuracy of the future game situation that are generated by iteratively applying the trained LSTM. For training the LSTM model we set batch size to 512 and used Adam optimizer [5] with $\text {the initial learning rate} = 0.001, \beta _1 = 0.9, \beta _2 = 0.999$. The LSTM generates a 512-dimensional output vector after taking a single-situation information and the output from the previous LSTM block as the input for the next LSTM block. The output vector is used as a part of input for the next LSTM.

In the computational experiments, the number of generated future field situations by the forward simulation is specified to the number of the past situations in the input time series. Figure 11 shows this procedure. For example in the case of four past situations, first, the four past situations $\varvec{X}_{t-4}$, $\varvec{X}_{t-3}$, $\varvec{X}_{t-2}$, $\varvec{X}_{t-1}$, and the current situation $\varvec{X}_t$ are given as the input to the model in order to generate the estimated next situation $\varvec{X}_{t+1}'$. A full-connection layer (FC) is used to generate the estimated next situation $\varvec{X}_{t+1}$ after processing the last piece of the input time series. Then, $\varvec{X}_{t+2}'$ is predicted with another five situations of $\varvec{X}_{t-3}$, $\varvec{X}_{t-2}$, $\varvec{X}_{t-1}$, $\varvec{X}_{t}$, $\varvec{X}_{t+1}'$ (i.e., the predicted values in the last iteration). This procedure is repeated four times to finally generate the estimated future game situation $\varvec{X}_{t+4}'$. The error between the last predicted $\varvec{X}_{t+4}'$ and the actual value is investigated. Evaluation of the each models is made based on MAE between the model’s output and the ground truth of each of the corresponding objects’ positions.

Table 3 indicates the results of the experiment. The results show that the prediction for three situations has less error than that for five situations. As a matter of course, the results indicate that prediction is more difficult as the number of situations increases, since the predicted values is repeatedly stacked as input instead of actual values.

Table 3. Experimental results of forward simulation using LSTM.

Full size table

4.3 Evaluation of Game Situations by FOSSE Approach

In Sect. 3, it was shown that the architecture of the DNN that uses future information produced the best situation evaluation among the four investigated models. Thus, we employ that type of the DNN model as an evaluation model. Since the future information is not available during a game, we estimate it by forward simulation that was described in Sect. 4.2. Moreover, based on the the results of the computational experiments in Sect. 4.2, the LSTM is employed as a forward simulation part in our FOSSE architecture. The overview of the resultant FOSSE architecture that we employ in this paper is shown in Fig. 12. In our FOSSE architecture, a field situation is evaluated by using the predicted future information (i.e., $\varvec{X}_{t+1}',\ldots ,\varvec{X}_{t+n_f}'$) generated by forward simulation. The DNN model and the RNN model are separately constructed. In the forward simulation, the LSTM model predicts the field situation of the next time cycle as shown the bottom part of Fig. 12. Then, in the situation evaluation part, the SituationScore of the current game situation is estimated using the current game situation as well as the estimated future game situations that are generated by the forward simulation.

4.4 Experiment

Experimental Settings. Table 4 indicates the settings of the models that are used in the computational experiment in this section. We compare the accuracy performance of four trained DNN models using single situation, multiple past situations, multiple future situations, and multiple predicted future situations (i.e., the proposed method).

Table 4. Experimental settings of evaluating situation.

Full size table

Results. Table 5 shows the results of the experiments. It is shown that higher accuracy is demonstrated by the proposed method than using the field situation of a single situation. Besides, the performance of the proposed method is better than using multiple past situations when the field situations of three successive time cycles are used as input. As a result, it is shown that evaluation using predicted future situations by simulation model is more useful in the situation evaluation than using already-known past situations. Although using multiple future situations leads to a high accuracy of the trained model, this is only ideal because the future information is not available at the current time. Thus, using multiple future situations is not a real option for model building in real-time games. On the other hand, the proposed method can be used during ongoing games because the field situation of future time cycles is generated by forward simulation.

On the contrary, when the field situations of five successive time cycles are used, the proposed method outperforms using single situation. This, however, cannot show an effectiveness compared with the model using past situations. This is considered to be due to the fact that the error of the forward simulation model’s output increases as the number of situations increases as described in Sect. 4.2. Although this paper employs a simple simulation model, it can be expected that the accuracy will be improved by elaborating more on the forward simulation model. The improvement of the forward simulation model is left for our future work.

The results of the computational experiments show the effectiveness of the proposed method that evaluates the situation combined with forward simulation. Accurate evaluation of game situations is important for the victory in many sports, not just in soccer. The other sports can be also benefitted by the FOSSE approach in evaluating the game situations.

Table 5. Experimental results (FOSSE model).

Full size table

5 Conclusion

In this paper, we proposed FOSSE approach for evaluating game situation of RoboCup Soccer Simulation 2D League. Three contributions in evalating a game situation were presented. The first contribution is to show the effectiveness of using the field situations of multiple time cycles rather than only a single situation. The second contribution is to show that future information is more valuable than past information. The third contribution, which is the main contribution, is to propose FOSSE approach where simulated future information was generated by forward simulation. The FOSSE approach consists of two parts: Forward simulation part and situation evaluation part.

In our FOSSE approach, a DNN with multiple future situations was used as the situation evaluation part, and the LSTM was used for the forward simulation part. From the computational experiments, the effectiveness of our model was shown. This achievement allows us to evaluate the game situation during the ongoing game in real time. It is expected that the FOSSE approach can be applied to other sports as well as soccer such as rugby and basket ball.

The idea of this approach is similar to human thinking processes. People often unconsciously perform forward simulation when evaluating the situation in real life. When humans guess SituationScore in a certain situation, it has possibility that they consider not only the current game situation but also expected future game situations. If it is proved that the proposed method is the same process as human thought process, it is considered effective to reproduce human thinking process by machine learning method.

6 Future Work

This paper conducted the computational experiments with only two teams. That is, only two teams were involved in the generation of training and test datasets. Considering the practical application where various teams are involved in a tournament, it is necessary to show that the proposed method works in general for any other teams. In the future, we will investigate the generalization of the proposed method. That is, it is necessary to examine the performance of the trained model to unknown teams that are not included in the generation process of the training dataset.

Furthermore, as already mentioned in the experiments of Sect. 4.4, it is necessary to consider improving the prediction accuracy of the forward simulation. For instance, different architectures of the forward simulation model and evaluation situation model can be used by increasing the number of hidden layers or by changing the number of situations for the input. Another idea is to adapt the FOSSE approach for accommodating field image data because it was indicated in [2] that using image data could lead to a better accuracy performance in evaluating game situations. In addition, we will consider machine learning method that computationally realizes human thinking processes. Incorporating human thought processes into machine learning method has the potential to contribute to the development of artificial intelligence.

Ultimately, we would like to implement it on the RoboCup soccer team and apply it as an indicator of tactical switching during the game. In addition to that, we would like to apply it for enhancing the game-watching experience, which is not related to the implementation of a team.

References

Kitano, H., Asada, M., Kuniyoshi, Y., Noda, I., Osawa, E., Matsubara, H.: RoboCup: a challenge problem for AI. AI Mag. 18(1), 73–85 (1997)
Google Scholar
Pomas, T., Nakashima, T.: Evaluation of situations in RoboCup 2D simulations using soccer field images. In: Holz, D., Genter, K., Saad, M., von Stryk, O. (eds.) RoboCup 2018. LNCS (LNAI), vol. 11374, pp. 275–286. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27544-0_23
Chapter Google Scholar
Akiyama, H., Nakashima, T., Suzuki, Y., Ohori, A., Fukushima, T.: HELIOS2018: team description paper. In: RoboCup 2018, Montreal, p. 6 (2018)
Google Scholar
Akiyama, H., Nakashima, T.: HELIOS base: an open source package for the RoboCup soccer 2D simulation. In: Behnke, S., Veloso, M., Visser, A., Xiong, R. (eds.) RoboCup 2013. LNCS (LNAI), vol. 8371, pp. 528–535. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44468-9_46
Chapter Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: Proceedings of the International Conference on Learning Representations (2015)
Google Scholar
Khosroshahi, A., Ohn-Bar, E., Trivedi, M.M.: Surround vehicles trajectory analysis with recurrent neural networks. In: Proceedings of the IEEE 19th Conference on Intelligent Transportation Systems (ITSC), pp. 2267–2271 (2016)
Google Scholar
Choi, E., Bahadori, M.T., Schuetz, A., Stewart, W.F., Sun, J.: Doctor AI: predicting clinical events via recurrent neural networks. In: Proceedings of the Machine Learning for Healthcare 2016, pp. 301–318 (2016)
Google Scholar
Shi, Z., Xu, M., Pan, Q., Yan, B., Zhang, H.: LSTM-based flight trajectory prediction. In: Proceedings of the 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 (2018)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. J. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Alahi, A., Goel, K., Ramanathan, V., Robicquet, A., Li, F.-F., Savarese, S.: Social LSTM : human trajectory prediction in crowded space. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 961–971 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Humanities and Sustainable System Sciences, Osaka Prefecture University, Osaka, Japan
Yudai Suzuki & Tomoharu Nakashima

Authors

Yudai Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Tomoharu Nakashima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yudai Suzuki .

Editor information

Editors and Affiliations

University of Newcastle, Callaghan, NSW, Australia
Stephan Chalup
Google, X, The Moonshot Factory, Munich, Germany
Tim Niemueller
Mahidol University, Nakhon Pathom, Thailand
Jackrit Suthakorn
University of Technology, Sydney, NSW, Australia
Mary-Anne Williams

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Suzuki, Y., Nakashima, T. (2019). On the Use of Simulated Future Information for Evaluating Game Situations. In: Chalup, S., Niemueller, T., Suthakorn, J., Williams, MA. (eds) RoboCup 2019: Robot World Cup XXIII. RoboCup 2019. Lecture Notes in Computer Science(), vol 11531. Springer, Cham. https://doi.org/10.1007/978-3-030-35699-6_23

Download citation

DOI: https://doi.org/10.1007/978-3-030-35699-6_23
Published: 01 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-35698-9
Online ISBN: 978-3-030-35699-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

On the Use of Simulated Future Information for Evaluating Game Situations

Abstract

Similar content being viewed by others

Gaming Simulation Design for Individual and Team Situation Awareness

Human Strategic Reasoning in Dynamic Games: Experiments, Logics, Cognitive Models

Survey: Development and Analysis of a Games-Based Crisis Scenario Generation System

Keywords

1 Introduction