{"id":184,"date":"2024-09-17T14:00:14","date_gmt":"2024-09-17T14:00:14","guid":{"rendered":"https:\/\/www.colips.org\/conferences\/nysf2024\/wp\/?page_id=184"},"modified":"2025-10-22T07:36:11","modified_gmt":"2025-10-22T07:36:11","slug":"keynote","status":"publish","type":"page","link":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/","title":{"rendered":"Keynote"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-page\" data-elementor-id=\"184\" class=\"elementor elementor-184\">\n\t\t\t\t<div class=\"elementor-element elementor-element-dc38ecf e-flex e-con-boxed e-con e-parent\" data-id=\"dc38ecf\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t<div class=\"elementor-element elementor-element-e6f56c6 e-con-full e-flex e-con e-child\" data-id=\"e6f56c6\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-64d4cba elementor-widget elementor-widget-image\" data-id=\"64d4cba\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img fetchpriority=\"high\" decoding=\"async\" width=\"734\" height=\"1013\" src=\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie-742x1024.jpg\" class=\"attachment-large size-large wp-image-368\" alt=\"\" srcset=\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie-742x1024.jpg 742w, https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie-217x300.jpg 217w, https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie-768x1060.jpg 768w, https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie-1113x1536.jpg 1113w, https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie-1170x1615.jpg 1170w, https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie.jpg 1200w\" sizes=\"(max-width: 734px) 100vw, 734px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-b29dd31 e-con-full e-flex e-con e-child\" data-id=\"b29dd31\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-3fede75 elementor-widget elementor-widget-text-editor\" data-id=\"3fede75\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<ul style=\"font-size: 1.1em; font-weight: bold; margin: 0;\">Exploration of Song Generation and Evaluation Frameworks<\/ul>\n<div style=\"font-size: 1em; font-weight: bold; margin: 10px 0 5px 0;\">Abstract<\/div>\n<div style=\"text-align: justify;\">Music, as a fundamental component of human culture, embodies emotional expression and creative innovation. From classical to contemporary periods, its forms and modes of creation have continually evolved. In recent years, the rapid advancement of artificial intelligence has profoundly transformed the field of music generation, with generative models offering powerful capabilities for the automatic composition of high-quality musical works. This talk focuses on song generation and evaluation, examining methodologies for music creation and aesthetic assessment based on generative modeling. We first introduce an end-to-end song generation framework &#8212; DiffRhythm that integrates melody, lyrics, and vocal synthesis within a unified architecture. Subsequently, we present the construction of a music aesthetics evaluation dataset&#8211;SongEval, which provides a reliable foundation for assessing the artistic quality and perceptual appeal of generated songs. Building upon these components, we further propose an enhanced generation framework DiffRhythm+ that refines musicality and expressive creativity through improved model design and evaluation feedback mechanisms. Through this talk, the report highlights recent explorations of generative modeling and evaluative technologies in the domain of music generation, aiming to contribute new perspectives and methodological insights for future research in automatic song generation and assessment.<\/div>\n&nbsp;\n<div style=\"font-size: 1em; font-weight: bold; margin: 0 0 5px 0;\">Biography<\/div>\n<div style=\"text-align: justify;\">Lei Xie is a Professor at the School of Computer Science, Northwestern Polytechnical University (NPU), Xi\u2019an, China, where he leads the Audio, Speech and Language Processing Laboratory (ASLP@NPU). Prior to joining NPU, he held research positions at Vrije Universiteit Brussel (VUB), City University of Hong Kong, and The Chinese University of Hong Kong. Professor Xie has authored more than 400 peer-reviewed papers in leading journals and conferences on speech and audio processing, which have collectively received over 15,000 citations according to Google Scholar. In 2024, he was recognized on the Stanford University and Elsevier list of the world\u2019s most highly cited scientists. His current research interests span a broad range of topics in speech and language processing, multimedia, and human\u2013computer interaction. He serves as a Senior Area Editor for IEEE\/ACM Transactions on Audio, Speech, and Language Processing and IEEE Signal Processing Letters. He is also the Vice Chair of the ISCA Special Interest Group on Chinese Spoken Language Processing (ISCA-CSLP) and has previously served as a member of the IEEE Speech and Language Technical Committee (SLTC).<\/div>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Exploration of Song Generation and Evaluation Frameworks Abstract Music, as a fundamental component of human culture, embodies emotional expression and creative innovation. From classical to contemporary periods, its forms and modes of creation have continually evolved. In recent years, the&#8230;<br \/><a class=\"read-more-button\" href=\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/\">Read more<\/a><\/p>\n","protected":false},"author":3,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-184","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Keynote - Nanyang Speech Technology Forum 2025<\/title>\n<meta name=\"robots\" content=\"noindex, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Keynote - Nanyang Speech Technology Forum 2025\" \/>\n<meta property=\"og:description\" content=\"Exploration of Song Generation and Evaluation Frameworks Abstract Music, as a fundamental component of human culture, embodies emotional expression and creative innovation. From classical to contemporary periods, its forms and modes of creation have continually evolved. In recent years, the...Read more\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/\" \/>\n<meta property=\"og:site_name\" content=\"Nanyang Speech Technology Forum 2025\" \/>\n<meta property=\"article:modified_time\" content=\"2025-10-22T07:36:11+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"1656\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/\",\"url\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/\",\"name\":\"Keynote - Nanyang Speech Technology Forum 2025\",\"isPartOf\":{\"@id\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie-742x1024.jpg\",\"datePublished\":\"2024-09-17T14:00:14+00:00\",\"dateModified\":\"2025-10-22T07:36:11+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#primaryimage\",\"url\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie.jpg\",\"contentUrl\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie.jpg\",\"width\":1200,\"height\":1656},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Keynote\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/#website\",\"url\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/\",\"name\":\"Nanyang Speech Technology Forum 2025\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Keynote - Nanyang Speech Technology Forum 2025","robots":{"index":"noindex","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_US","og_type":"article","og_title":"Keynote - Nanyang Speech Technology Forum 2025","og_description":"Exploration of Song Generation and Evaluation Frameworks Abstract Music, as a fundamental component of human culture, embodies emotional expression and creative innovation. From classical to contemporary periods, its forms and modes of creation have continually evolved. In recent years, the...Read more","og_url":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/","og_site_name":"Nanyang Speech Technology Forum 2025","article_modified_time":"2025-10-22T07:36:11+00:00","og_image":[{"width":1200,"height":1656,"url":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/","url":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/","name":"Keynote - Nanyang Speech Technology Forum 2025","isPartOf":{"@id":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#primaryimage"},"image":{"@id":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#primaryimage"},"thumbnailUrl":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie-742x1024.jpg","datePublished":"2024-09-17T14:00:14+00:00","dateModified":"2025-10-22T07:36:11+00:00","breadcrumb":{"@id":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#primaryimage","url":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie.jpg","contentUrl":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/wp-content\/uploads\/2025\/10\/Lei-Xie.jpg","width":1200,"height":1656},{"@type":"BreadcrumbList","@id":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/keynote\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/"},{"@type":"ListItem","position":2,"name":"Keynote"}]},{"@type":"WebSite","@id":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/#website","url":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/","name":"Nanyang Speech Technology Forum 2025","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/wp-json\/wp\/v2\/pages\/184","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/wp-json\/wp\/v2\/comments?post=184"}],"version-history":[{"count":77,"href":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/wp-json\/wp\/v2\/pages\/184\/revisions"}],"predecessor-version":[{"id":476,"href":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/wp-json\/wp\/v2\/pages\/184\/revisions\/476"}],"wp:attachment":[{"href":"https:\/\/www.colips.org\/conferences\/nysf2025\/wp\/index.php\/wp-json\/wp\/v2\/media?parent=184"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}