<?xml version='1.0' encoding='UTF-8'?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
  <responseDate>2026-03-07T11:09:00Z</responseDate>
  <request identifier="oai:kougei.repo.nii.ac.jp:00002056" verb="GetRecord" metadataPrefix="jpcoar_2.0">https://kougei.repo.nii.ac.jp/oai</request>
  <GetRecord>
    <record>
      <header>
        <identifier>oai:kougei.repo.nii.ac.jp:00002056</identifier>
        <datestamp>2023-06-20T13:38:09Z</datestamp>
        <setSpec>12:17:266:270</setSpec>
      </header>
      <metadata>
        <jpcoar:jpcoar xmlns:datacite="https://schema.datacite.org/meta/kernel-4/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcndl="http://ndl.go.jp/dcndl/terms/" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:jpcoar="https://github.com/JPCOAR/schema/blob/master/2.0/" xmlns:oaire="http://namespace.openaire.eu/schema/oaire/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rioxxterms="http://www.rioxx.net/schema/v2.0/rioxxterms/" xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns="https://github.com/JPCOAR/schema/blob/master/2.0/" xsi:schemaLocation="https://github.com/JPCOAR/schema/blob/master/2.0/jpcoar_scm.xsd">
          <dc:title>強化学習の転移学習における転移率を用いた再利用方策学習進度の可逆性</dc:title>
          <dc:title xml:lang="en">Reversibility Validation of Learning Progression Using Transfer Rate in Transfer Reinforcement Learning</dc:title>
          <jpcoar:creator>
            <jpcoar:creatorName>河野, 仁</jpcoar:creatorName>
            <jpcoar:creatorName xml:lang="ja-Kana">コウノ, ヒトシ</jpcoar:creatorName>
          </jpcoar:creator>
          <jpcoar:creator>
            <jpcoar:creatorName>佐藤, 弘和</jpcoar:creatorName>
            <jpcoar:creatorName xml:lang="ja-Kana">サトウ, ヒロカズ</jpcoar:creatorName>
          </jpcoar:creator>
          <jpcoar:creator>
            <jpcoar:creatorName xml:lang="en">Kono, Hitoshi</jpcoar:creatorName>
          </jpcoar:creator>
          <jpcoar:creator>
            <jpcoar:creatorName xml:lang="en">Sato, Hirokazu</jpcoar:creatorName>
          </jpcoar:creator>
          <jpcoar:subject subjectScheme="Other">強化学習</jpcoar:subject>
          <jpcoar:subject subjectScheme="Other">転移学習</jpcoar:subject>
          <jpcoar:subject subjectScheme="Other">転移率</jpcoar:subject>
          <jpcoar:subject subjectScheme="Other">転移曲面</jpcoar:subject>
          <jpcoar:subject xml:lang="en" subjectScheme="Other">Reinforcement learning</jpcoar:subject>
          <jpcoar:subject xml:lang="en" subjectScheme="Other">Transfer learning</jpcoar:subject>
          <jpcoar:subject xml:lang="en" subjectScheme="Other">Transfer rate</jpcoar:subject>
          <jpcoar:subject xml:lang="en" subjectScheme="Other">Transfer surface</jpcoar:subject>
          <datacite:description descriptionType="Abstract">本論文では，強化学習における転移学習で使用される，転移率というパラメータの効果を検証する．転移率は，転移学習時の過学習状態を回避するために用いられるが，再利用方策の学習進度を疑似的にロールバックできると考えられている．本論文では実際の強化学習・転移学習シミュレーションから，学習進度をロールバックできるか効果を検証したので報告する．</datacite:description>
          <datacite:description descriptionType="Abstract">In recent years, the robot systems with learning algorithms are deployed in the real world situation,for example, automatic driving car, warehouse robots and so on. A reinforcement learning (RL) can be contributed for increasing of intelligence of the robot system, and RL do not need the supervised data.
Additionally, RL can explore the optimal solution by itself. However, the robot with reinforcement learning(called RL-agent) has probability to encounter with over fitting caused by reusing obtained policy. A transfer rate has been proposed to reduce the utilization of the policy. Moreover, the transfer rate is thought to have the effect of rolling back the learning progress of the policy to be reused. However, this effectiveness is not validated based on actual reinforcement learning and transfer learning. In this paper, the transfer rate is validated with transfer surface which is visual and quantitative evaluation method of transfer, and the transfer rate is verified the contribution for rolling back of learning progress of reusing policy for transfer learning.</datacite:description>
          <dc:publisher>東京工芸大学工学部</dc:publisher>
          <datacite:date dateType="Issued">2019-12-25</datacite:date>
          <dc:language>jpn</dc:language>
          <dc:type rdf:resource="http://purl.org/coar/resource_type/c_6501">departmental bulletin paper</dc:type>
          <jpcoar:identifier identifierType="URI">https://kougei.repo.nii.ac.jp/records/2056</jpcoar:identifier>
          <jpcoar:sourceIdentifier identifierType="ISSN">03876055</jpcoar:sourceIdentifier>
          <jpcoar:sourceTitle>東京工芸大学工学部紀要</jpcoar:sourceTitle>
          <jpcoar:sourceTitle xml:lang="en">The Academic Reports, the Faculty of Engineering, Tokyo Polytechnic University</jpcoar:sourceTitle>
          <jpcoar:volume>42</jpcoar:volume>
          <jpcoar:issue>1</jpcoar:issue>
          <jpcoar:pageStart>25</jpcoar:pageStart>
          <jpcoar:pageEnd>30</jpcoar:pageEnd>
          <jpcoar:file>
            <jpcoar:URI label="vol42-1-04">https://kougei.repo.nii.ac.jp/record/2056/files/vol42-1-04.pdf</jpcoar:URI>
            <jpcoar:mimeType>application/pdf</jpcoar:mimeType>
            <jpcoar:extent>4.4 MB</jpcoar:extent>
            <datacite:date dateType="Available">2019-12-25</datacite:date>
          </jpcoar:file>
        </jpcoar:jpcoar>
      </metadata>
    </record>
  </GetRecord>
</OAI-PMH>
