<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet type="text/xsl" href="style/detail_T.xsl"?>
<bibitem type="C">   <ARLID>0380442</ARLID> <utime>20240103201204.9</utime><mtime>20121031235959.9</mtime>         <title language="eng" primary="1">Reducing Instruction Issue Overheads in Application-Specific Vector Processors</title>  <specification> <page_count>8 s.</page_count> <media_type>C</media_type> </specification>   <serial><ARLID>cav_un_epca*0380441</ARLID><ISBN>978-0-7695-4798-5</ISBN><title>Proceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012</title><part_num/><part_title/><page_num>600-607</page_num><publisher><place>Izmir</place><name>Conference Publishing Services</name><year>2012</year></publisher><editor><name1>Niar</name1><name2>Smail</name2></editor></serial>    <keyword>custom accelerators</keyword>   <keyword>vector processing</keyword>   <keyword>FPGA</keyword>   <keyword>DSP</keyword>    <author primary="1"> <ARLID>cav_un_auth*0261496</ARLID> <name1>Sýkora</name1> <name2>Jaroslav</name2> <full_dept language="cz">Zpracování signálů</full_dept> <full_dept language="eng">Department of Signal Processing</full_dept> <department language="cz">ZS</department> <department language="eng">ZS</department> <institution>UTIA-B</institution> <garant>G</garant>  <fullinstit>Ústav teorie informace a automatizace AV ČR, v. v. i.</fullinstit> </author> <author primary="0"> <ARLID>cav_un_auth*0202591</ARLID> <name1>Bartosinski</name1> <name2>Roman</name2> <full_dept language="cz">Zpracování signálů</full_dept> <full_dept>Department of Signal Processing</full_dept> <department language="cz">ZS</department> <department>ZS</department> <institution>UTIA-B</institution>  <fullinstit>Ústav teorie informace a automatizace AV ČR, v. v. i.</fullinstit> </author> <author primary="0"> <ARLID>cav_un_auth*0225749</ARLID> <name1>Kohout</name1> <name2>Lukáš</name2> <full_dept language="cz">Zpracování signálů</full_dept> <full_dept>Department of Signal Processing</full_dept> <department language="cz">ZS</department> <department>ZS</department> <institution>UTIA-B</institution> <full_dept>Department of Signal Processing</full_dept>  <fullinstit>Ústav teorie informace a automatizace AV ČR, v. v. i.</fullinstit> </author> <author primary="0"> <ARLID>cav_un_auth*0101077</ARLID> <name1>Daněk</name1> <name2>Martin</name2> <full_dept language="cz">Zpracování signálů</full_dept> <full_dept>Department of Signal Processing</full_dept> <department language="cz">ZS</department> <department>ZS</department> <institution>UTIA-B</institution>  <fullinstit>Ústav teorie informace a automatizace AV ČR, v. v. i.</fullinstit> </author> <author primary="0"> <ARLID>cav_un_auth*0233557</ARLID> <name1>Honzík</name1> <name2>P.</name2> <country>CZ</country>  </author>   <source> <url>http://library.utia.cas.cz/separaty/2012/ZS/sykora-reducing instruction issue overheads in application-specific vector processors.pdf</url> </source>        <cas_special> <project> <project_id>7H10001</project_id> <agency>GA MŠk</agency> <country>CZ</country> <ARLID>cav_un_auth*0272880</ARLID> </project> <project> <project_id>Artemis JU 100230</project_id> <agency>Commission EU</agency> <country>XE</country> <ARLID>cav_un_auth*0283284</ARLID> </project>  <abstract language="eng" primary="1">The traditional approach to IP core design is to use simulations with test vectors. This is not feasible when dealing with complex function cores such as the Image Segmentation case-study algorithm in this paper. An algorithm developer needs to carry out experiments on large real-world data sets, with fast turn-around times, and in real time to facilitate performance tuning and incremental development. Previously we proposed a methodology called Application-Specific Vector Processor (ASVP). The ASVP approach first constructs a programmable architecture customized for a given application, then employs software techniques to develop firmware that implements the algorithm. In our setting we employ an embedded simple scalar CPU (8-bit PicoBlaze 3) to control a floating-point vector processing unit (VPU) by issuing wide (horizontally encoded) instructions to it. In this work we dramatically reduce the overhead of the wide-instruction issue (in one case by 13x) by implementing a new two-level configuration table.</abstract>  <action target="WRD"> <ARLID>cav_un_auth*0283283</ARLID> <name>15th Euromicro Conference on Digital System Design</name> <place>Cesme</place> <dates>05.09.2012-08.09.2012</dates>  <country>TR</country> </action>    <reportyear>2013</reportyear>  <RIV>JC</RIV>      <num_of_auth>5</num_of_auth>  <presentation_type> PR </presentation_type>  <permalink>http://hdl.handle.net/11104/0211153</permalink>        <arlyear>2012</arlyear>       <unknown tag="mrcbU63"> cav_un_epca*0380441 Proceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012 978-0-7695-4798-5 600 607 Izmir Conference Publishing Services 2012 </unknown> <unknown tag="mrcbU67"> Niar Smail 340 </unknown> </cas_special> </bibitem>