Place preference and vocal learning rely on distinct reinforcers in songbirds

Abstract In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multipl...

Descripción completa

Guardado en:

Detalles Bibliográficos
Autores principales:	Don Murdoch, Ruidong Chen, Jesse H. Goldberg
Formato:	article
Lenguaje:	EN
Publicado:	Nature Portfolio 2018
Materias:	Medicine R Science Q
Acceso en línea:	https://doaj.org/article/c5992da32d684df8b89acf3c6d7bc3a3
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

id	oai:doaj.org-article:c5992da32d684df8b89acf3c6d7bc3a3
record_format	dspace
spelling	oai:doaj.org-article:c5992da32d684df8b89acf3c6d7bc3a32021-12-02T12:31:57ZPlace preference and vocal learning rely on distinct reinforcers in songbirds10.1038/s41598-018-25112-52045-2322https://doaj.org/article/c5992da32d684df8b89acf3c6d7bc3a32018-04-01T00:00:00Zhttps://doi.org/10.1038/s41598-018-25112-5https://doaj.org/toc/2045-2322Abstract In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-agents, each of which is shaped by agent-specific reinforcement, but it remains unknown if animals adopt this strategy. Here we use songbirds to test if navigation and singing, two behaviors with distinct objectives, can be differentially reinforced. We demonstrate that strobe flashes aversively condition place preference but not song syllables. Brief noise bursts aversively condition song syllables but positively reinforce place preference. Thus distinct behavior-generating systems, or agencies, within a single animal can be shaped by correspondingly distinct reinforcement signals. Our findings suggest that spatially segregated vocal circuits can solve a credit assignment problem associated with multiobjective learning.Don MurdochRuidong ChenJesse H. GoldbergNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 8, Iss 1, Pp 1-9 (2018)
institution	DOAJ
collection	DOAJ
language	EN
topic	Medicine R Science Q
spellingShingle	Medicine R Science Q Don Murdoch Ruidong Chen Jesse H. Goldberg Place preference and vocal learning rely on distinct reinforcers in songbirds
description	Abstract In reinforcement learning (RL) agents are typically tasked with maximizing a single objective function such as reward. But it remains poorly understood how agents might pursue distinct objectives at once. In machines, multiobjective RL can be achieved by dividing a single agent into multiple sub-agents, each of which is shaped by agent-specific reinforcement, but it remains unknown if animals adopt this strategy. Here we use songbirds to test if navigation and singing, two behaviors with distinct objectives, can be differentially reinforced. We demonstrate that strobe flashes aversively condition place preference but not song syllables. Brief noise bursts aversively condition song syllables but positively reinforce place preference. Thus distinct behavior-generating systems, or agencies, within a single animal can be shaped by correspondingly distinct reinforcement signals. Our findings suggest that spatially segregated vocal circuits can solve a credit assignment problem associated with multiobjective learning.
format	article
author	Don Murdoch Ruidong Chen Jesse H. Goldberg
author_facet	Don Murdoch Ruidong Chen Jesse H. Goldberg
author_sort	Don Murdoch
title	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_short	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_full	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_fullStr	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_full_unstemmed	Place preference and vocal learning rely on distinct reinforcers in songbirds
title_sort	place preference and vocal learning rely on distinct reinforcers in songbirds
publisher	Nature Portfolio
publishDate	2018
url	https://doaj.org/article/c5992da32d684df8b89acf3c6d7bc3a3
work_keys_str_mv	AT donmurdoch placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds AT ruidongchen placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds AT jessehgoldberg placepreferenceandvocallearningrelyondistinctreinforcersinsongbirds
_version_	1718394241027145728

Place preference and vocal learning rely on distinct reinforcers in songbirds

Ejemplares similares