Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
情報処理装置及び情報処理方法
Document Type and Number:
Japanese Patent JP7494856
Kind Code:
B2
Abstract:
An information processing apparatus (100) includes: an acquisition unit (153) that acquires a machine learning model trained with reinforcement learning such that, when first state information indicating a first state has been input, the model will output first action information indicating a first action corresponding to the first state, based on a plurality of rewards weighted by a weight of each of the rewards; a reception unit (151) that receives training data being a set of second state information indicating a second state and second action information indicating a second action corresponding to the second state; and a display unit (156) that displays information regarding the weight of each of the rewards estimated by training the machine learning model in which the weight of each of the rewards is defined as a part of a connection coefficient of the machine learning model such that, when the second state information included in the training data and a value based on the weight of each of the rewards have been input, the model will output the second action information included in the training data.

Inventors:
Tomoya Kimura
Application Number:
JP2021552104A
Publication Date:
June 04, 2024
Filing Date:
July 14, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
Sony Group Corporation
International Classes:
G06N20/00
Domestic Patent References:
JP2018181343A
Foreign References:
WO2018110305A1
Attorney, Agent or Firm:
Sakai International Patent Office