这是indexloc提供的服务,不要输入任何密码
Skip to content

[FEAT]: Agent Skill - Scrape Websites option to not summarise #3318

@CaffiendDevKit

Description

@CaffiendDevKit

Version

Docker Hub - Latest - I guess v1.7.4

What would you like to see?

I am trying to follow the Agent Flows Tutorial but the web scraper or the agent always summarizes the HTML. This causes all the links to be the same and invalid.

It would be nice to have an explicit option to tell it to pass the raw scrapped content to the next block.

I even tried with the API Call block and the content was also summarized.

I attempted to alter the agent description to tell it to not summarize large responses but that didn't seem to make a difference.

Additionally I noticed that even when looking at a small return less than 100 tokens it would still attempt to summarize saying it was too large. So it appears to be a hard coded step.

The "summarized" output, note all links are iterated from item?id=123456 and other changes such as the user.

<!DOCTYPE html>
<html>
<head>
    <title>Hacker News</title>
    <meta charset="utf-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <link rel="stylesheet" href="https://news.ycombinator.com/news.css">
</head>
<body>
    <table>
        <tr>
            <td>
                <span>1.</span>
                <a href="https://www.example.com">Example Title</a>
                <span>(123 points)</span>
                <span>by <a href="https://news.ycombinator.com/user?id=exampleuser">exampleuser</a> 1 hour ago</span>
                <span>| <a href="https://news.ycombinator.com/item?id=123456">123 comments</a></span>
            </td>
        </tr>
        <tr>
            <td>
                <span>2.</span>
                <a href="https://www.example.com">Deep Dive into Llama3 AI Model</a>
                <span>(90 points)</span>
                <span>by <a href="https://news.ycombinator.com/user?id=exampleuser">exampleuser</a> 2 hours ago</span>
                <span>| <a href="https://news.ycombinator.com/item?id=123457">90 comments</a></span>
            </td>
        </tr>
        <tr>
            <td>
                <span>3.</span>
                <a href="https://www.example.com">AI Accumulating Wealth</a>
                <span>(78 points)</span>
                <span>by <a href="https://news.ycombinator.com/user?id=exampleuser">exampleuser</a> 3 hours ago</span>
                <span>| <a href="https://news.ycombinator.com/item?id=123458">78 comments</a></span>
            </td>
        </tr>
        <!-- more table rows with submission information -->
    </table>
    <table>
        <tr>
            <td>
                <a href="https://news.ycombinator.com/more">more</a>
            </td>
        </tr>
    </table>
</body>
</html>

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions