你应该知道的事情：▸ 在标准考试中，它还没有完全达到 GPT-4，但与 v1.3 相

首页发布

Lock me up and throw away the key
He knows how to get the best out of me
I'm no force for the world to see
Trade my whole life just to be
Top of the world but I'm still not free
This is a secret that I keep
Until it's gone, I can never find peace
Waste my whole life just to be

https://t.cn/A60wSoQb

Claude-2，Anthropic 对 GPT-4 的射击，已经到来。它比 GPT-4 更便宜，并且在推理和编码方面比旧版强大得多。

你应该知道的事情：
▸ 在标准考试中，它还没有完全达到 GPT-4，但与 v1.3 相比已经快速赶上。括号内获胜者：
GRE 口语：165 vs 169（GPT-4 获胜）
GRE 写作：5 vs 4（克劳德）
GRE 定量：154 vs 163 (GPT-4)
USMLE：~67 与 ~85 (GPT-4)
小节：76.5 vs 75.7（克劳德）

▸ 在推理基准上，
HumanEval 编码：71.2% vs 67%（克劳德获胜）。与此同时，GPT-3.5 的得分仅为 48%。克劳德的编码能力得到了显着的提高。
GSM-8K 小学数学：88% 与 92% (GPT-4)。与之前的版本相比，Claude 从 85.2 提高到 88。

▸ Claude 2（100K 上下文）比 GPT-4-32K 便宜 4-5 倍！假设标记化长度相似，提示代币成本为 11 美元，而每百万美元为 60 美元，完成成本为 32 美元，而每百万美元为 120 美元。

▸ Claude-2 的知识截止时间是 2023 年初，而 GPT-4 的知识截止时间是 2021 年 9 月。所以它的记忆更新鲜。

▸ 脑叶白质切除术非常具有侵略性。 Claude 2 在提供无害响应方面比 v1.3 好 2 倍。 Anthropic 与联盟研究中心 (ARC) 和外部红队成员合作进行安全审计。

▸ 10% 的训练数据是非英语的。

▸ 您可以在 https://t.cn/A60ZFggI 尝试一下。 Claude 的长上下文意味着您可以上传整个论文和代码文件以请求摘要或错误修复。

* 关于标准化考试的重要警告：提示协议可能非常不同，并且大量考试没有错误栏。比较可能不具有统计显着性。

Claude-2, Anthropic's shot at GPT-4, has arrived. It's cheaper than GPT-4 and far stronger in reasoning & coding than its older self.

Things you should know:
▸ On standard exams, it's not quite at GPT-4 yet but catching up fast compared to v1.3. Winner in bracket:
GRE verbal: 165 vs 169 (GPT-4 wins)
GRE writing: 5 vs 4 (Claude)
GRE quantitative: 154 vs 163 (GPT-4)
USMLE: ~67 vs ~85 (GPT-4)
Bar: 76.5 vs 75.7 (Claude)

▸ On reasoning benchmarks,
HumanEval coding: 71.2% vs 67% (Claude wins). Meanwhile GPT-3.5 only scores 48%. Claude's coding ability has improved dramatically.
GSM-8K grade-school math: 88% vs 92% (GPT-4). Claude improves from 85.2 -> 88 vs its prior version.

▸ Claude 2 (100K context) is 4-5x cheaper than GPT-4-32K! Prompt tokens cost $11 vs $60/Million, and completion costs $32 vs $120/M, assuming similar tokenization length.

▸ Claude-2's knowledge cutoff is in early 2023, while GPT-4 is Sept. 2021. So it's got much fresher memory.

▸ Lobotomy is very aggressive. Claude 2 is 2x better at giving harmless responses than v1.3. Anthropic worked with the Alignment Research Center (ARC) and external red teamers for safety audits.

▸ 10 percent of training data is non-English.

▸ You can try it out at https://t.cn/A60wVS17 Claude's long context means that you can upload entire papers and code files to ask for summary or bug fix.

* Big caveat on the standardized exam: the prompting protocols may be very different, and there're no error bars on a large number of exams. The comparison may not be statistically significant. #禅与计算机程序设计艺术#

【#法拉第未来临时首席财务官换人#】
7月13日消息，电动汽车初创企业法拉第未来智能电气 (FFIE) 表示，公司发现 2022 年年报以及部分季报中存在一些错误，将提交重新编制的财报。此外法拉第未来还宣布，公司临时首席财务官 Yun Han 辞职，并继续担任首席会计官一职。法拉第未来任命外部人士乔纳森・马罗科 (Jonathan Maroko) 为新的临时首席财务官，于 7 月 24 日生效。法拉第未来表示，由于发现了一些错误，2022 年年报、截至 2022 年 9 月以及截至 2023 年 3 月的季报“不再可靠”（should no longer be relied upon）。这些错误与公司所发行的某些票据公允价值转换有关。法拉第未来表示，会提交经过重新编制的财务报表，不会影响到电动汽车交付计划。