+
Skip to content

Conversation

bsowell
Copy link
Contributor

@bsowell bsowell commented Oct 21, 2024

  1. I observed the LLM sometimes having issues distinguishing where one table ended and the next started, so I added headers - either ELEMENT 1/ELEMENT 2 or DOCUMENT 1/DOCUMENT 2 in the llm_query transform.

  2. Added a default implementation for preprocess_element and postprocess_element in the ElementMerger base class. This just reduces code in some simple sub-classes.

  3. Added code to update the table_continuation to be True or False. This is because I ended up getting better results when asking the LLM to explain its reasoning in some cases, but I didn't want all of the output carried along in the element.

  4. Minor fix to address an invalid escape character warning in a comment.

1. I observed the LLM sometimes having issues distinguishing where one
table ended and the next started, so I added headers - either ELEMENT
1/ELEMENT 2 or DOCUMENT 1/DOCUMENT 2 in the llm_query transform.

2. Added a default implementation for preprocess_element and
postprocess_element in the ElementMerger base class. This just reduces
code in some simple sub-classes.

3. Added code to update the table_continuation to be True or False.
This is because I ended up getting better results when asking the LLM
to explain its reasoning in some cases, but I didn't want all of the
output carried along in the element.

4. Minor fix to address an invalid escape character warning in a comment.
Copy link
Contributor

@dhruvkaliraman7 dhruvkaliraman7 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm. Although I didn't understand 3. How did adding boolean help ?

@bsowell bsowell merged commit ab9f840 into main Oct 22, 2024
12 of 13 checks passed
@bsowell bsowell deleted the table_merge_tweaks branch October 22, 2024 00:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载