Is it possible to expand the size of semi-structured reports dataset?

A lot of RAG based chatbot is based on PDFs like documents, Also how to deal with PDF files is necessary for promoting the performance   
of RAG.  Thanks in advance for your effort of creating the semi-structured dataset for this senarios. But there is so few of the PDFs and the QA pairs, just 6 docs and 30 QA pairs in total. I see in langchain Simth, the [dataset dashboard](https://smith.langchain.com/public/c47d9617-ab99-4d6e-a6e6-92b8daf85a7d/d?tab=0&paginationModel=%7B%22pageIndex%22%3A0%2C%22pageSize%22%3A50%7D&paginationState=%7B%22pageIndex%22%3A0%2C%22pageSize%22%3A10%7D), most of the experiments are in 100% Faithfulness. Is that possible to expand this benchmark? 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is it possible to expand the size of semi-structured reports dataset? #210

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Is it possible to expand the size of semi-structured reports dataset? #210

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions