[{"createTime":1735734952000,"id":1,"img":"hwy_ms_500_252.jpeg","link":"https://activity.huaweicloud.com/cps.html?fromacct=261f35b6-af54-4511-a2ca-910fa15905d1&utm_source=V1g3MDY4NTY=&utm_medium=cps&utm_campaign=201905","name":"华为云秒杀","status":9,"txt":"华为云38元秒杀","type":1,"updateTime":1735747411000,"userId":3},{"createTime":1736173885000,"id":2,"img":"txy_480_300.png","link":"https://cloud.tencent.com/act/cps/redirect?redirect=1077&cps_key=edb15096bfff75effaaa8c8bb66138bd&from=console","name":"腾讯云秒杀","status":9,"txt":"腾讯云限量秒杀","type":1,"updateTime":1736173885000,"userId":3},{"createTime":1736177492000,"id":3,"img":"aly_251_140.png","link":"https://www.aliyun.com/minisite/goods?userCode=pwp8kmv3","memo":"","name":"阿里云","status":9,"txt":"阿里云2折起","type":1,"updateTime":1736177492000,"userId":3},{"createTime":1735660800000,"id":4,"img":"vultr_560_300.png","link":"https://www.vultr.com/?ref=9603742-8H","name":"Vultr","status":9,"txt":"Vultr送$100","type":1,"updateTime":1735660800000,"userId":3},{"createTime":1735660800000,"id":5,"img":"jdy_663_320.jpg","link":"https://3.cn/2ay1-e5t","name":"京东云","status":9,"txt":"京东云特惠专区","type":1,"updateTime":1735660800000,"userId":3},{"createTime":1735660800000,"id":6,"img":"new_ads.png","link":"https://www.iodraw.com/ads","name":"发布广告","status":9,"txt":"发布广告","type":1,"updateTime":1735660800000,"userId":3},{"createTime":1735660800000,"id":7,"img":"yun_910_50.png","link":"https://activity.huaweicloud.com/discount_area_v5/index.html?fromacct=261f35b6-af54-4511-a2ca-910fa15905d1&utm_source=aXhpYW95YW5nOA===&utm_medium=cps&utm_campaign=201905","name":"底部","status":9,"txt":"高性能云服务器2折起","type":2,"updateTime":1735660800000,"userId":3}]
<>期待已久之后,终于拿到了文心一言的邀请码,第一时间进行了测试。最后面会讲一下如何获取邀请码。
<>先说一下结论,很远,但是又不远。
很远是因为
:我个人测试得出来的实际效果和ChatGPT差距还很大,下面我会放一些对比。当然也有很多正面例子,回答和ChatGPT相当,甚至中文语境下还好一些。值得肯定。
不远是因为: 作为第一个敢正面硬刚ChatGPT的百度,打响了第一枪,和国内其它各个大厂阿里腾讯头条等等,赶上去需要的只是时间。
注意:我们只测试用中文问答的能力,对比英文的话,对wenxin不太公平?_
<>话不多说,先来看看对比:
wenxin:
ChatGPT:
<>看起来还不错哦,不知道为啥变成英文的了。
<>还有很多测试就不放了,dddd
<>总结一下:
* 基本的检索,然后规整文本输出,文心一言还是可以的。
* 需要稍微有点逻辑的问题就答非所问,大概一半的情况生成不完整的句子。
* 很多常见问题聊天问题,文心一言如果检索不到答案,直接就上兜底策略。
* 国内NLPer暂时不会失业了~
* 第一个吃螃蟹还是比较困难的~
* 我猜想效果不好的原因包括但不限于:国内中文语料库的问题(很多问题),缺乏足够并且好的RHLF,这个需要时间积累,显然赶鸭子上架是不可能的。
* 欢迎评论区补充。
* 其它方面让ChatGPT和文心一言自己来说吧:
<>如何申请邀请码:
**C端用户:**访问 yiyan.baidu.com,点击体验文心
**B端用户:**wenxin.baidu.com, 找到对话API申请,(我是通过B端的申请,然后由于人数限制,暂时给的个人端的权限。)
<>下面是文心一言发来的邀请码邮件最后一段: