LOGIN | Register
Cooperation
Create2025百度AI开发者大会
文章语言:
EN
Share
Minutes
原文
会议摘要
Baidu founder Li Yanhong emphasized the importance of the rapid development of AI technology at the Intelligent Cloud Developer Ecological Conference, introducing the multimodal, strong reasoning, and low-cost features of the Wenxin big model 4.5 Turbo and X1 Turbo versions. The conference showcased the upgrades of AI functions in digital human applications, one-click cloning functionality, Baidu Wenku, and online disk, as well as autonomous driving technology and the "Quick Build" no-code programming tool. Baidu is promoting AI innovation through the MCP ecosystem, and the Qianfan platform is fully compatible with MCP, supporting developers in creating and releasing MCP servers. At the same time, Baidu showcased the lighting up of a 30,000-card machine cluster of Kunlun chip T8, introducing the significant benefits of the P800 chip in finance, intelligent customer service, and multimodal data analysis applications, as well as the release of Kunlun chip super node technology, greatly improving single node performance and computing power management efficiency.
会议速览
Application value and developer opportunities in the era of large models
With the rapid development of AI technology, especially the frequent iteration of large models, developers are facing the challenge of existing applications quickly becoming outdated. However, this also provides developers with more choices and possibilities. By understanding technological trends and avoiding the extended line of development of large models, developers can create applications with practical value by leveraging the increasingly powerful model capabilities. These applications not only will not be overshadowed by the capabilities of large models but can penetrate more scenarios and enhance their value. For example, in the field of traffic safety event detection, the combination of cloud-based large models and edge system small models has significantly improved detection accuracy and reduced the workload of monitoring personnel. Such applications that can bring practical value are real opportunities for developers, and opportunities like these abound in various industries. Baidu has already generated significant effects by deploying large models and providing free call opportunities, as well as actively applying them in internal business lines.
百度智能云发布文心大模型4.5 Turbo和X One Turbo
Baidu Intelligent Cloud has released the Wenxin Large Model 4.5 Turbo and X One Turbo, aiming to solve current issues in AI models such as multimodal understanding, hallucination rate, cost, and speed. Wenxin 4.5 Turbo is a multimodal model that can integrate understanding of text, speech, images, and videos. Its API call price is only 1% of GPT4.5, and it performs excellently in the understanding of images and videos. X One Turbo enhances deep thinking and logical reasoning abilities, while also strengthening multimodal processing capabilities. It can call different tools at only half the price of similar models. The release of these two new versions marks the trend of multimodal models becoming the standard for future foundational models.
Introduction of price and capabilities of the Wenshin large model X One Turbo
Discussed the price, performance, and multi-tool invocation capabilities of the Wenshen big model X One Turbo, demonstrating its advantages in AI application development through practical examples, and emphasizing the importance of cost reduction in driving AI innovation.
Breakthrough and application of highly persuasive AI digital humans in 2025.
One important breakthrough in 2025 is the AI digital human technology, especially high convincing digital humans, which surpass the experience of real people through ultra-realistic voice and form, professional content, and flexible interaction. Such digital humans have huge application space in areas such as e-commerce, live broadcasting, and gaming. Compared to traditional digital humans, high convincing digital humans are driven by rich multi-mode scripts, not only providing vivid dialogues, but also adjusting expressions, tones, and movements in real-time, achieving smooth transitions between emotions and actions, with a performance far exceeding conventional standards, and even difficult to distinguish from real people.
The application and technological breakthroughs of virtual influencers in e-commerce livestreaming.
In the field of e-commerce live streaming, digital human technology is being widely used to build efficient live streaming teams, achieving the effect of one person as a marketing team. Through the real-time scheduling of AI brains, digital humans can flexibly switch roles, such as hosts, co-hosts, etc., and promote sales based on the popularity and conversion rate of the live streaming room. In order to enable more people to have exclusive digital humans and monetize them, a one-click cloning function is introduced, which can train a reusable digital human for live streaming in just two minutes of video. Highly persuasive digital humans are typical applications of multimodal large models. Baidu Wenku has successfully created an AI function with over 40 million paying users by combining multiple models, demonstrating the practical effect of using multiple models in combination.
Multimodal content generation and processing technology in the liberalization department
The liberalization department is able to handle a variety of modalities and file types of materials, including Word, PDF, images, audio, video, etc. Through AI search and image-to-image functions, users can efficiently research and create content. For example, when researching the impact of Yangtze River porpoises on the population of finless porpoises, users can drag and drop or paste various materials, specify the use of materials, and generate the required content with one click, such as long articles, PPTs, or video picture books. This ability is based on the combined use of multiple models, including Wenxin Jintiao model, multimodal model, proprietary model, and industry model, collectively forming the technical foundation of Cangzhou OS. Cangzhou OS includes check out plus and the two core components of the three libraries and three gases, which can vectorize and combine content from different modalities to generate new content. Based on this technical foundation, Baidu Wangpan recently launched the AI note function, further expanding the application scenarios of content generation and processing.
Baidu Cloud Product Manager Introduces AI Note-taking Function
The product manager of Baidu Netdisk introduced the AI note function, which uses AI technology to automatically generate detailed notes with strong relevance to the original study materials. AI notes not only generate comprehensive knowledge content, but also provide timestamp tracking, knowledge tables, mind maps, and test questions based on video content, supporting multimodal content learning. Users can share notes, and this function has received positive feedback since its launch. Baidu Wenku and Netdisk plan to continue introducing more AI functions, aiming to become efficient learning and productivity tools.
Progress of visual large-scale models in applications of autonomous driving and intelligent agents.
Discussed the application of large visual models in the field of autonomous driving, emphasizing the safety and convenience of autonomous driving, as well as its global promotion trend. In addition, it also introduced the rapid development of intelligent agents in AI applications, especially the role of code agents in improving programming efficiency and no-code programming tools, such as Baidu's Wenxin Kuaima Commit and Miaoda, demonstrating how they help non-technical personnel easily complete tasks, as well as innovative applications in the fields of education and entertainment.
Seconds provide assistance to college student entrepreneurs and ordinary individuals to realize their technology dreams.
The instant platform helps college student entrepreneurial teams to quickly develop purchasing and delivery integrated systems, requiring only a few minutes and no need for a professional development team, significantly reducing the technical threshold. By uploading requirement documents, the instant intelligent body automatically completes tasks such as page design and function development, integrating Baidu's intelligent cloud storage, database, map navigation, and other practical functions. In addition, the instant platform also helps non-professional technical personnel such as Shaanxi fruit farmers, retired master craftsmen, and Suzhou embroiderers to realize their respective application dreams, demonstrating the future vision that ordinary people can also have programming capabilities after the popularization of technology.
HeartMind APP: The future application of multi-agent collaboration intelligence.
Heartthinking App is a super versatile intelligent agent application that solves complex problems for users through autonomous planning and collaboration with multiple intelligent agents. It can provide detailed plans and solutions in various life scenarios such as travel planning, children's video picture book creation, and legal consultation. In addition, Heartthinking has also shown excellent performance in areas such as in-depth research, data analysis, and health consultation, integrating multiple tools and continuously optimizing the user experience. Heartthinking App has been launched on major Android stores and will release more beta features in the future.
The importance of integrating AI applications with the Baidu Open Platform and MCP.
Baidu Search Open Platform, through its AI Open Plan, aims to quickly connect users with various AI applications, including intelligent entities, H5 mini programs, and independent apps, providing users with the latest and most complete AI services, while also offering developers traffic and revenue opportunities. Through examples such as the integration of a 3D home design AI application, it demonstrates how users can quickly find and use AI services through search. In addition, Baidu Search will comprehensively index MCP servers in the market, providing developers with comprehensive development tools, emphasizing the importance of MCP in AI application development.
MCP model context protocol promotes innovation and development of AI applications.
With the explosion of AI applications, developers are facing challenges such as tool standardization, platform adaptation, and component integration. The emergence of Model Context Protocol (MCP) is like installing a universal socket for AI, simplifying the development process and improving efficiency. Through MCP, developers can standardize their resource data capabilities output, while utilizing existing MCP server resources to reduce development workload and enhance application capabilities. Domestic and foreign technology giants such as Anthropic, OpenAI, Google, Alibaba, ByteDance, and Tencent are actively embracing MCP. Baidu is helping developers fully utilize MCP by optimizing the Wenxin large model, making the Qianfan platform compatible with MCP, and building the MCP server discovery platform, among other measures. Specific cases show the application of MCP in Samsung phones accessing Baidu Wenku NetDisk functions, intelligent bodies combining search, and MCP servers providing precise product recommendations and transaction services. Baidu will continue to promote the prosperity of the MCP ecosystem and empower developers to innovate.
Baidu's 3rd Wenxin Cup Entrepreneurship Competition officially kicks off.
Baidu has always provided developers with support such as models, development tools, and financial resources, promoting the prosperity of the large model ecosystem through the Wenxin Cup Entrepreneurship Competition. The two competitions have attracted over 2500 teams worldwide to sign up, and Baidu has provided over 200 million yuan in financial support and comprehensive assistance to the winning teams. Nearly half of the winning teams have received funding for the next round, showing a good trend of development. The third Wenxin Cup Entrepreneurship Competition has officially started, with increased support for entrepreneurs, the prize amount for single projects will double, and the investment amount for special prizes can reach up to 70 million yuan.
Baidu announces plans to train 10 million AI professionals in the next five years and launch a fully self-developed cluster of 30,000 cards.
Baidu promises to increase the training of AI talents in the future five years, aiming to cultivate 10 million AI talents. At the same time, they introduced the first domestically developed 30,000-card cluster - Kunlun Core P800, which can effectively support large-scale AI model training, significantly increase chip utilization, and reduce energy consumption. In addition, Baidu also released a number of AI applications and technologies, including Wenxin large model, highly persuasive digital person, Cangzhou OS system, intelligent body instant progress, multi-agent collaboration APP, etc., aiming to provide developers with powerful tools and platforms to promote the development of the intelligent economy.
The application of AI technology in cultural relics protection and intangible cultural heritage martial arts inheritance.
The conversation focused on the Baidu AI Developer Conference and provided a detailed introduction to artificial intelligence technology, especially the Wenxin large model's innovative applications in various fields. The discussion highlighted how AI technology can play a role in cultural relics protection and intangible cultural heritage martial arts inheritance. Through collaboration with the Wenxin large model, it is possible to achieve a deep understanding and cross-temporal dissemination of cultural relics, as well as systematic protection and inheritance of intangible cultural heritage martial arts. In addition, the conversation also introduced the application of code intelligent agents and intelligent code assistant Wenxin Kuaima in the field of software development, demonstrating how AI technology can lower the development threshold and improve development efficiency. Overall, it showcased the huge potential and actual effectiveness of AI technology in promoting cultural inheritance, enhancing work efficiency, and other aspects.
The world's first humanoid robot to complete a half marathon and its technological innovations
With the support of Baidu Intelligent Cloud, the Beijing Robotics Innovation Center has successfully developed the Tengong Satellite Robot. In a recent half marathon, the robot completed the race in 2 hours, 40 minutes, and 42 seconds, becoming the world's first humanoid robot to complete a half marathon. This achievement not only represents a breakthrough in sports events, but also serves as an extreme test of the long-term stable operation technology of humanoid robots in real-life scenarios. The center is leveraging the Tengong hardware platform and the general intelligent software platform to create a globally influential Jusheng Intelligent Innovation Base and application pilot industrial base. The Tengong robot is fully electrically driven, stands at 1.73 meters tall, weighs over 70 kilograms, and has 42 degrees of freedom. The upgraded Tengong 2.0 has enhanced artificial intelligence capabilities and upper limb operation abilities, enabling it to perform complex task planning and human-machine interaction. Additionally, the center has also released the world's first universal Jusheng Intelligent platform for one robot to assist multiple elderly people, which has already begun application trials in industrial, service, and educational scenarios.
Baidu Smart Cloud helps enterprises achieve business innovation through large models.
The dialogue emphasized the key role of Baidu Intelligent Cloud in driving business innovation, especially in the efficient application development and model fine-tuning using large models. Through the Qianfan platform, enterprises can not only access and use a range of advanced models, such as self-developed Wenxin models and third-party models, but also effectively reduce costs and improve performance through model distillation technology. In addition, the Qianfan platform provides low-code and high-code development paradigms, as well as an enterprise-level agent framework, enabling enterprises to combine their own knowledge base and industry experience to customize exclusive intelligent bodies to meet specific business needs. Baidu Intelligent Cloud further promotes the openness of the ecology and the convenient use of data and tools by supporting the MCP protocol, encouraging developers to develop and share their own MCP services, jointly building a stronger ecosystem. Through the support of these technologies and platforms, enterprises like Minsheng Bank have successfully achieved industry innovation and practice.
Exploration and Practice of Construction and Application of AI Large Model in China Minsheng Bank
China Minsheng Bank shared its progress and experience in the construction and application of artificial intelligence large models. The bank clearly stated that the large-scale application of AI technology is a strategic initiative, adopting a strategy of actively embracing, steadily advancing, and focusing on the development of systematic capabilities and implementing specific scenarios. Through close cooperation with partners such as Baidu, Minsheng Bank has built a large model technology support system, implemented more than forty applications, covering over one hundred and forty specific scenarios. It particularly emphasizes the importance of fine-tuning in model applications, as well as the decisive role of the completeness of the knowledge system in realizing the value of large models. In addition, Minsheng Bank is developing its first AI strategy, emphasizing the importance of talent, technology-driven, architectural transformation, and AI governance, aiming to achieve a leap from digitization to intelligence, while ensuring the security and ethical compliance of AI applications.
Innovation practice of intelligent customer service and visual AI applications
In this sharing session, two innovative products were introduced in detail - the intelligent customer service "Keyue" and the visual AI application "Yijian". The intelligent customer service "Keyue" achieves precise targeting of the target audience through a high emotional intelligence gold sales model, smoothly connecting to human customer service, as well as customer segmentation, old customer activation, and recovery functions. The demonstration showed how "Keyue" effectively helps customers solve problems with financial product refunds and recommends financial products that meet their needs, demonstrating its ability to understand and meet user needs. The visual AI application "Yijian" lowers the threshold for visual scene development through edge collaboration, enabling business personnel to participate in development and significantly reducing implementation costs. Through practical examples, it was demonstrated how "Yijian" can real-time detect whether hamburger production in the kitchen of a restaurant chain meets standards, effectively preventing customer complaints. These products not only demonstrate the application of intelligent infrastructure but also highlight their wide applicability and efficiency in various industries such as finance and catering.
AI Developer Conference: From Computing Power Models to Building Intelligent Infrastructure
The AI Developers Conference shared the complete process of AI application development, from selecting appropriate scenarios, utilizing the capabilities of large models to developing products, to paying attention to inference costs and model fine-tuning when scaling applications. Through cooperation cases with enterprises, such as the intelligent upgrade project of Chinese Steel Research, it demonstrated how to build the company's own AI infrastructure and promote the industry's intelligent transformation. The conference emphasized the importance of open systems, encouraged innovation, and further shared progress at the afternoon Developer's Ecosystem Conference, aiming to empower customers and partners comprehensively, so that artificial intelligence can truly serve to improve production efficiency and create a better life.
要点回答
Q:Against the backdrop of rapid development in AI technology, should developers be concerned that applications developed on large models will be rapidly overtaken by iteratively large models, becoming outdated and losing value?
A:This is a double-edged sword issue. On one hand, developers do need to pay attention to the technological trends and avoid closely following the development of large models; on the other hand, the strong model capability provides developers with more choices and possibilities. As long as the application scenario is identified, the appropriate basic model is selected, and optimization methods are learned, the application developed based on this will not become outdated, because the application itself is the key to creating value.
Q:How can large models help improve and enhance the detection performance of small models in edge systems?
A:Large models deployed in the cloud can verify video clips that have been detected by small models on the client side within seconds, increasing the recognition accuracy of common events to over 95% and achieving over 90% accuracy for long-tail events. This significantly reduces the workload of monitoring personnel who need to intervene.
Q:When a large-scale model is combined with specific application scenarios, can practical value be created, and could you provide examples?
A:As the model's capability enhances, there are more opportunities for the integration of large models with application scenarios. For example, in the transportation industry, large models can assist in improving the accuracy of highway safety event detection. Not only will they not be overshadowed by the capabilities of large models, but they can also penetrate into more scenarios, creating higher value.
Q:How is the application of Deep sk on the Baidu Intelligent Cloud Qianfan platform?
A:Deep sk has been deployed on Baidu Intelligent Cloud Qianfan platform, providing free access opportunities to tens of thousands of developers. Baidu search, maps, and other business lines have integrated the full version of Deep sk, which has produced very good results.
Q:What improvements does the Wensin large model 4.5 and x one version have compared to previous models?
A:The 文心 4.5 version is the first native multimodal large model that integrates text, speech, images, and video understanding. It outperforms GPT 4.5 in multiple tests and has a lower cost of implementation. On the other hand, 文心x one is a deep thinking model that performs similarly to distil GPT-1 but at a lower cost. It emphasizes multimodal, strong reasoning, and low-cost features.
Q:How do the 文心 4.5 turbo and x one turbo perform in understanding and processing multimodal content?
A:The Wenxin 4.5 turbo has shown significant progress in understanding images and videos, being able to accurately identify World Cup match segments and sink experiment scenes in blurry images and improve logical reasoning and coding abilities. The Wenxin x one turbo not only has more advanced vocabulary chains and deep thinking abilities, but can also access different tools such as internet search and AI drawing tools to complete complex tasks, such as creating high-quality Garlic Bird images and writing resumes.
Q:Why does Baidu continue to lower the price of large models?
A:Reducing the cost of large models is to eliminate the barrier that developers cannot widely apply due to high costs. With cost reduction, developers and entrepreneurs can innovate more confidently, and companies can deploy large models at low costs, thereby driving the explosion of applications in various industries.
Q:What are the application scenarios of highly persuasive digital humans in e-commerce, live streaming, and other fields?
A:Highly persuasive digital humans possess characteristics such as ultra-realistic voice and appearance, professional content output, and flexible interaction, with huge application space in e-commerce, live streaming, and other fields. They are driven by rich multimodal scripts, surpassing the expressiveness of real humans, able to flexibly coordinate multiple roles based on real-time scenarios, achieving immersive experiences and efficient marketing conversions.
Q:Can e-commerce live streaming be replicated by digital humans, and how can one-click cloning functionality be achieved?
A:We have launched a one-click cloning feature, where users only need to record a short live video of at least two minutes and upload it to the Baidu Hui Bo Xing platform for basic training. After that, they can repeatedly use this digital person for live broadcasting, making it possible for everyone to become a live streamer.
Q:What achievements does Baidu Wenku have in the application of multimodal large models?
A:Baidu Wenku's AI function has over 40 million paid users and 97 million monthly active users. Among them, "free canvas" is a typical example of using multiple models in combination, which can handle multiple file types and combine different models according to user needs to process materials and generate content.
Q:How can a blank canvas help users efficiently utilize materials to create content?
A:The Freedom canvas allows users to drag various materials from the cloud disk (such as Word, PDF, pictures, audio, video, etc.). Users can paste web links or supplement content through AI search, and specify usage methods for the materials. For example, users can select a research paper on Yangtze River dolphins and a policy analysis paper, quote keywords and integrate content, and then generate the required format of content with one click, such as long articles, PPTs, or picture books.
Q:What is the technical foundation of Baidu Wenku and what are its core features?
A:The technical foundation of Baidu Wenku is called Cangzhou OS, which includes check out plus (to analyze and vectorize different modal content) and three databases and three tools (public domain knowledge base, private domain knowledge base, memory base; editor, reader, player). These technical capabilities can combine multiple models to process audio, video, and other materials, and solidify into a set of solid technical foundations.
Q:How is Baidu Netdisk's AI note implemented and meeting user needs?
A:AI Notes utilizes Baidu's powerful AI technology to support the integration of video content with text notes, generating comprehensive notes with rich multimedia elements and a timestamp trace. It can also organize knowledge points into tables or mind maps, and support users in practicing test questions to consolidate learning outcomes. Through practical demonstrations, it showcases the convenient recording, explanatory review, and generation of comprehensive knowledge tables while watching videos, integrating the advantages of traditional notes and meeting the demand for multimodal content learning.
Q:When you use the beta version of the MindApp, what kind of itinerary plan does it provide for you?
A:When I first arrived in Wuhan, I requested the Heart Imagine APP to plan a relaxing afternoon itinerary for me through its beta version. Based on my personalized preferences (solo travel, no specific companion, hoping to blend natural scenery and cultural features), it generated a detailed plan, marked the route on the map, recommended the nearby Tanghu Park and restaurants, automatically collected discount information from the entire network, and helped me make reservations. Finally, it reminded me to prepare for the return journey at 5 o'clock in the afternoon and helped me make a reservation for a taxi.
Q:When you mentioned that you encountered some problems while on a business trip, I was thinking of how to help you.
A:When I encountered a problem with the landlord refusing to return the deposit due to issues with the property during my business trip, I described the situation to a legal expert using my heart and they quickly analyzed and clarified the key points of the issue. They helped me find three civil law expert lawyers to provide professional advice and organized the legal basis and documents needed for the protection of rights, generating a comprehensive legal analysis report and suggestions for the steps needed for protection of rights.
Q:How does Baidu's AI open plan work on Baidu search open platform?
A:Baidu's AI Open Plan on Baidu Search aims to embrace various AI applications, establish diverse distribution mechanisms, provide users with the latest and most comprehensive AI services, and also provide traffic and revenue for developers. For example, when a user enters a specific requirement in the search box (such as 3D home design), Baidu Search will display relevant AI application cards, allowing users to customize exclusive solutions and generate visual effects with just one click. Currently, there are applications in multiple fields such as AI interview assistant, professional medical consultation, and visual content production integrated into the platform.
Q:What role does the Model Context Protocol (MCP) play in AI development?
A:MCP provides AI developers with a unified standard and efficient way to solve problems such as lack of standards, low development efficiency, and difficulty in integration and maintenance when using tools. With MCP, developers only need to write the interface once according to the standard, and they can easily call various tool resources, greatly reducing the development burden. This makes it easier for AI to access information and freely call other tools, promoting the progress of AI development.
Q:What support measures does Baidu have in terms of Multi-Certified Professional (MCP)?
A:Baidu has not only optimized the Wenxin basic large model to improve its task planning and scheduling capabilities when using MCP, but also fully compatible with MCP and supports developers to create and publish MCP on the Qianfan platform. Baidu Search has built an MCP server discovery platform, indexed high-quality MCP servers on the entire network, and launched Baidu Wenxin Quick Horse commit as the first intelligent coding assistant supporting MCP server. In addition, Baidu has also opened up many MCP servers for its own applications and services, such as Baidu Netdisk, Baidu commodity retrieval and trading, and the MCP server of Baidu Wenku Netdisk for developers to call, empowering developers to innovate and jointly build a prosperous MCP ecosystem.
Q:What are the new startup information for the Third Wenxin Cup Entrepreneurship Competition?
A:The third Wenzhen Cup Entrepreneurship Competition has officially launched. We will increase our support for entrepreneurs in this competition, doubling the prize money for single projects, with the investment for special grand prizes reaching up to 70 million RMB.
Q:What commitments does Baidu have in talent cultivation in the field of AI? What challenges does Baidu face in AI technology research and development?
A:Baidu is a technology company that always adheres to innovation and is dedicated to the training of AI talent. We previously proposed to train 5 million AI talents for society, and currently there are 6.3 million AI talents. In the next five years, we will increase efforts to train another 10 million AI talents, to support the development of the intelligent economy. The construction of a 30,000-card cluster faces comprehensive challenges from hardware to software. Baidu solves these problems through the Baidu heterogeneous computing platform, which includes ensuring the stability of the cluster with a super-large-scale high-performance network, innovative design energy-saving solutions, and other measures, thereby providing China with the confidence to develop AI applications.
Q:What support measures does Baidu provide for developers and entrepreneurs?
A:In order to help developers and entrepreneurs develop AI applications at a lower cost, Baidu has launched a major initiative, using Kunlun chip's third-generation product P800 to build the first domestically developed 30,000-card cluster, which can handle the training needs of large-scale AI models.
Q:What innovative applications and tools did Baidu release at the AI Developer Conference?
A:During the conference, Baidu released a series of products including the more powerful and cost-effective Wenxin large model 0.5 turbo and x one turbo, the high persuasive digital human, the powerful Cangzhou OS system code, the latest progress of Intelligent Body Second Development, multi-intelligent agent collaboration APP, as well as the forward-thinking Baidu Search AI Open Plan. They successfully lit up China's first self-developed 30,000-card cluster.
Q:How does AI help visually impaired individuals achieve their programming dreams?
A:Among visually impaired individuals, Ko passed through using Baidu's auxiliary encoding application Wenxin Kuaima, after adapting to the screen reading function, can smoothly write code, realize their programmer dream, become the team's chief architect, develop accessible applications, reflecting the promoting role of AI technology in inclusive employment and innovation creation.
Q:What are the capabilities expansion and technological innovations of the Baidu Wenxin large model 4.5 turbo compared to its predecessor X1 turbo?
A:The Wenshen large model 4.5 turbo and X1 turbo have further improved in their multimodal capabilities, with better performance and lower costs. Key technologies include multimodal heterogeneous expert modeling, self-feedback enhancement technology framework, reinforcement learning technology that integrates preference learning, and a composite thinking chain model that combines tool invocation in deep thinking. These technologies collectively drive significant improvements in the model's understanding and processing capabilities for complex tasks.
Q:How can Wenzhen Kuai Ma help programmers improve the efficiency and quality of their programming?
A:The intelligent code assistant Wenxin Kuaima version 3.5 has significantly improved the development experience through breakthroughs in four core capabilities. Firstly, the code intelligent body engine supports multi-modal programming development tools and application previews, achieving end-to-end generation of requirement coding debugging verification. Secondly, the code prediction and rewriting engine adds cursor prediction and multi-line intelligent rewriting functions, accurately handling the addition, deletion, and modification of complex code. Thirdly, the context engine combines the reasoning abilities of Wenxin 4.5 and X1 to better understand the developer's intentions. Lastly, the more open development ecosystem is fully compatible with mainstream development tool chains through the MCP access protocol.
Q:What are the performances of PaddlePaddle framework 3.5 in terms of technological innovation and performance improvement?
A:PaddlePaddle 3.5 framework has made innovative breakthroughs in key technologies of deep learning frameworks, including the unified automatic parallelism technology which reduces distributed training code for large models by 80%, the one-stop training and inference technology which accelerates reinforcement learning training by 114%, the speed of solving high-order differential equations in scientific computing is 115% faster than pytorch, and the optimization of neural network compiler which improves end-to-end model training speed by 27%. In addition, PaddlePaddle has been adapted to more than sixty series of chips at home and abroad, achieving software-hardware co-optimization, greatly improving development efficiency and model operation efficiency.
Q:Why does the Chinese Cultural Relics Exchange Center want to cooperate with the Wenxin Large Model, and what is the goal?
A:The cooperation between the China Cultural Relics Exchange Center and the Wenzhen Big Model mainly focuses on the knowledge storage and multimodal capabilities of the Wenzhen Big Model. Both parties hope to empower the development of cultural relics through artificial intelligence technology, using smart bodies to allow people to more conveniently, quickly, and comprehensively understand the details, historical stories, and profound value of cultural relics, thereby better promoting the efficient dissemination and popularization of cultural and historical knowledge, and promoting the open sharing of cultural resources.
Q:How to use AI to solve problems in the inheritance of intangible cultural heritage martial arts and promote innovation?
A:In the inheritance of intangible cultural heritage martial arts, AI technology is applied to standardize and systematize the excavation and organization of intangible cultural heritage martial arts, such as Tai Chi and Xing Yi Quan. Through the AI teaching and training evaluation model, the traditional problem of lack of professional guidance and coverage of the population in martial arts is solved, achieving digital inheritance. This allows more learners to have access to and learn intangible cultural heritage martial arts, ensuring that ancient martial arts skills can be continued and promoted in the digital age.
Q:What challenges do customers face in the application of large-scale models?
A:In the application of large models, customers not only need to call complex APIs, but also need to interface with various component tools and perform detailed orchestration. In order to further improve the effectiveness, they may need to adjust and customize specialized models, and consider factors such as computational performance, stability, and security in enterprise applications. This is essentially a system building process, which is both complex and indispensable, but can be accelerated and simplified.
Q:What work has Baidu Intelligent Cloud done in promoting the advancement of AI technology?
A:Baidu Intelligent Cloud is committed to building intelligent infrastructure, reducing the trial and error costs for customers and partners, and making innovation easier. Last year, it released the world's first intelligent computing operating system - WanYuan. In the past year, there have been continuous upgrades in computing power, models, and applications, such as the Kunlun chip T8, which demonstrated the progress of domestic computing power with a cluster of 30,000 cards, especially the P800 chip designed for large models, using a self-developed XPU architecture with excellent performance and low migration costs.
Q:How does the performance of the P800 chip in practical applications?
A:The P800 has been successfully applied in the financial industry, such as in China Merchants Bank, providing stable computing power support through the Kunlun Xinyun T8, enhancing the application effects in scenarios such as intelligent customer service and multimodal data analysis. At the same time, an increasing number of central enterprises, universities, and internet companies are deploying P800 computing power on a large scale. In addition, Kunlun Xinyun has also introduced super node technology, which significantly improves single node performance and scalability by integrating 64 Kunlun Xinyun AI accelerator cards into one machine.
Q:What are the advantages of Baidu Baige platform in computing power management?
A:Baidu Baige is an efficient and stable technical management platform that can meet the full-process needs of enterprises. It not only supports Baidu's own GPU computing power but also manages a large number of external GPU resources, serving various industries. In response to cost and speed requirements for inference, Baidu Baige continuously optimizes inference acceleration, allowing customers on the platform to experience a 20x increase in throughput and a 50% increase in inference speed. Efficiency improvements are achieved through large-scale PD separation deployment and multi-level expert parallel optimization, among other key technologies.
Q:How does the Beijing Humanoid Robot Innovation Center utilize Baidu's Intelligent Cloud technology support?
A:The Beijing Humanoid Robot Innovation Center has achieved significant results in the field of artificial intelligence, such as successfully developing the world's first humanoid robot to complete a half marathon. The center, in its exploration and practice of artificial intelligence, relies on the comprehensive AI infrastructure support provided by Baidu, including CPU and GPU computing power, Jushi intelligent training acceleration, and various capabilities of Wenxin big models, such as natural language understanding, human-machine interaction, spatial perception, etc. This has driven the application pilot of robots in industrial, service, and educational scenarios, achieving rapid iteration and deployment efficiency improvement.
Q:On the Qiantuan platform, what is the situation regarding the availability of models? In the process of implementing large models, has the demand for customized specialized models from enterprises increased?
A:On the Qiantuan platform, there are already more than a hundred models available, including Baidu's self-developed Wenxin series models and third-party models such as deep sak, Ramatongyi video, etc. These models have all been verified through practical applications, with good results and stable and reliable service. Indeed, industry consensus believes that in the process of implementing large models, the demand for customized and specialized models from enterprises is steadily increasing. For example, through model distillation technology, customers can significantly reduce inference costs while maintaining the same effectiveness.
Q:How does the Thousand Sail Platform help developers develop models?
A:On the Qianfan platform, there is a complete toolchain to support developers in developing models, such as sharing a case of model distillation. At the same time, it also provides two development paradigms: low-code and high-code, and has released an agent framework to meet the needs of different developers.
Q:What are the key issues that companies need to solve when using intelligent agents? How to customize a dedicated intelligent agent for specific enterprise needs?
A:When enterprises use intelligent agents, they need to address issues such as integrating their own private domain data and knowledge base learning, organizing business processes according to company norms, accessing multiple tools, and ensuring high security, stability, and controllability required for enterprise-level services. Taking Sewage Treasure as an example, based on the Qifan platform, the intelligent agent Pro can associate with the enterprise knowledge base and execute specific decision instructions, integrate and analyze the company's own database and public network data, generate professional reports, and thus meet the specific needs of the enterprise.
Q:How to enable agents to easily utilize internal enterprise data and tools?
A:Through the Model Capability Open Protocol (MCP) service, enterprises can easily integrate their own data and tools into the agent, achieve flexible combination, and enhance the capabilities of intelligent agents. Baidu has achieved full ecosystem compatibility with the MCP protocol, and has launched the Qianfan Enterprise MCP service, supporting developers to develop their own MCP series and release them with one click, making it easier for other developers to use.
Q:What work has Minsheng Bank done in large-scale model construction and application?
A:Minsheng Bank has applied AI technology to the banking industry as a strategic move to promote scale applications. They have established a strong technical support system consisting of computational models, data, and platforms, which has been implemented in over forty applications. Through close collaboration with Baidu and utilizing tools such as the Qifan large model platform and Baidu's machine learning platform, the bank has improved the efficiency and satisfaction of knowledge acquisition for all employees while also achieving certain successes in the field of AI for SE (service economy). Minsheng Bank believes that fine-tuning is essential for maximizing the value of large models and is actively promoting the simultaneous development of large model applications and knowledge management systems.
Q:What is the application and development stage of AI in the banking industry? What role does the technical architecture play in AI applications?
A:This year is regarded as the 25th year of AI agent or AI era, marking the explosive growth stage of the breadth and depth of AI applications. When China Minsheng Bank formulated its AI strategy, it believed that in order to achieve the leap from digitalization to intelligence, attention should be focused on four aspects: talent, training on AI cognition and application capabilities for all employees, technical architecture, and AI governance. In the process of building the AI-driven technological framework, banks should rely on the strength of industries and sectors to build their own technological systems and capabilities, especially in the realization of value in application scenarios. Facing the paradigm shift in technological applications brought by AI, it is necessary to find a path and method towards future development.
Q:What are the challenges and solutions for the application of AI in the financial industry?
A:In the application of AI, generative AI comes with security risks and ethical issues. On one hand, the financial industry must actively embrace large models, and on the other hand, it must strengthen AI governance to ensure its reliability and transparency, establish clear accountability mechanisms, and enhance related capabilities.
Q:What are the functions and application scenarios of the intelligent customer service product "Ke Yue" and the marketing service product "Ke Yue"?
A:"Guest Moon" is an intelligent customer service product that has been implemented in multiple industries. It can provide accurate outreach, high emotional intelligence interaction, connect with human customer service, and provide customer segmentation activation and recovery services throughout the customer lifecycle. For example, when dealing with user cancellation requests, it not only streamlines the process but can also identify user emotions and provide appropriate product recommendations.
Q:How can "one-click" products help solve problems in the production process of catering chain stores?
A:"One-click" products use AI visual technology to analyze and monitor in real-time whether the production processes, such as making hamburgers, comply with order specifications. If any discrepancies are found, timely warnings will be issued. By simplifying the generation and deployment of AI applications, chain stores can quickly adapt to different situations in various locations and effectively reduce complaints.
Q:How to choose scenarios, products, and computing power support in the process of AI application development?
A:In the development of AI applications, first of all, you need to choose the appropriate scenario and refer to standard cases; then design the product and fine-tune and develop it using the capabilities of large models; finally, fine-tune the model and scale the application using efficient and low-cost computing power platforms like Kongx and Baibai. Throughout this process, businesses can build their own AI infrastructure to adapt to changing demands and innovation opportunities.
play
普通话
普通话
进入会议
1.0
0.5
0.75
1.0
1.5
2.0