Article Data Extraction and Text Mining

フリーミアム
よって Ujeebu | 更新済み 10日前 | Media
人気

9.8 / 10

レイテンシー

3,526ms

サービスレベル

100%

すべてのディスカッションに戻る

Article truncation

avatar
jlehmer
4ヶ月前

Is there a limit to the size of an article you will extract? When I call the extraction endpoint with this url: https://arstechnica.com/science/2022/09/can-we-manipulate-our-mental-capacity-with-music/

Only part of the article text is returned. Any ideas what’s happening here?

avatar
jlehmer commented 24日前

@greyrock Yep agreed. My example still isn’t working either. The API owner hasn’t updated this discussion in months.

@lexper can we please get an update on this?

avatar
greyrock commented 1ヶ月前

Just to reiterate:

https://www.verywellfit.com/essential-yoga-poses-for-beginners-3566747

Stops at the first unordered list/

avatar
jlehmer commented 3ヶ月前

@lexpar I just wanted to follow up on this release. It looks like the truncation problem is still present. Do you have an update on when the release that fixes this will go live?

Thank you

avatar
jlehmer commented 4ヶ月前

@lexper That sounds good, I completely understand. If you could just keep me posted on when the release is live I’d appreciate it.

Thanks for addressing this issue.

avatar
lexper commented 4ヶ月前

Hello,

Apologies for the delay. Engineering are telling me they have detected a new case that’s not handled by our parser and have to properly address it. Our parser works in 95+% of cases, but every now and then we come across a new configuration which requires us to introduce improvements.

They are working on a fix and should be releasing it in a couple of weeks. Unfortunately we cannot release before that.

My sincere apologies.

Best Regards,

avatar
jlehmer commented 4ヶ月前

Hello,

I’m just checking in to see if there’s an update on this.

Thanks

avatar
lexper commented 4ヶ月前

Hi there,

This doesn’t seem to be an issue with the size of the article since the article is of reasonable size. I will escalate the issue to Engineering and get back to you asap.

We apologize for the inconvenience.

Regards,

ディスカッションに参加しましょう-以下にコメントを追加してください:

ログイン/サインアップして新しいコメントを投稿
評価:5-投票:3