Article Data Extraction and Text Mining

GRATIS CON POSSIBILITÀ DI UPGRADE
Da Ujeebu | Aggiornamento 25 days ago | Media
Popolarità

9.8 / 10

Latenza

4,345ms

Livello di servizio

100%

Torna a tutte le discussioni

Article truncation

Rapid account: Jlehmer
jlehmer
6 months ago

Is there a limit to the size of an article you will extract? When I call the extraction endpoint with this url: https://arstechnica.com/science/2022/09/can-we-manipulate-our-mental-capacity-with-music/

Only part of the article text is returned. Any ideas what’s happening here?

Rapid account: Jlehmer
jlehmer Commented 3 months ago

@greyrock Yep agreed. My example still isn’t working either. The API owner hasn’t updated this discussion in months.

@lexper can we please get an update on this?

Rapid account: Greyrock
greyrock Commented 3 months ago

Just to reiterate:

https://www.verywellfit.com/essential-yoga-poses-for-beginners-3566747

Stops at the first unordered list/

Rapid account: Jlehmer
jlehmer Commented 5 months ago

@lexpar I just wanted to follow up on this release. It looks like the truncation problem is still present. Do you have an update on when the release that fixes this will go live?

Thank you

Rapid account: Jlehmer
jlehmer Commented 6 months ago

@lexper That sounds good, I completely understand. If you could just keep me posted on when the release is live I’d appreciate it.

Thanks for addressing this issue.

Rapid account: Lexper
lexper Commented 6 months ago

Hello,

Apologies for the delay. Engineering are telling me they have detected a new case that’s not handled by our parser and have to properly address it. Our parser works in 95+% of cases, but every now and then we come across a new configuration which requires us to introduce improvements.

They are working on a fix and should be releasing it in a couple of weeks. Unfortunately we cannot release before that.

My sincere apologies.

Best Regards,

Rapid account: Jlehmer
jlehmer Commented 6 months ago

Hello,

I’m just checking in to see if there’s an update on this.

Thanks

Rapid account: Lexper
lexper Commented 6 months ago

Hi there,

This doesn’t seem to be an issue with the size of the article since the article is of reasonable size. I will escalate the issue to Engineering and get back to you asap.

We apologize for the inconvenience.

Regards,

Partecipa alla discussione - aggiungi un commento di seguito:

Accedi/Iscriviti per pubblicare nuovi commenti
Valutazione: 5 - Voti: 3