Me And You

Mobilizer fetches wrong page content

Recommended Posts

Hi!

When I use your mobilizer feature for hh.ru RSS feed I get sometimes a part of wrong page code (with error) instead of mobilized content I need.

How to reproduce:

My feed code:

https://hh.ru/search/vacancy/rss?salary=300000&items_on_page=100&order_by=publication_time&specialization=9.289&specialization=9.94&specialization=9.95&specialization=9.78&specialization=9.168&specialization=9.145&specialization=9.562&specialization=9.105&specialization=9.345&specialization=9.448&specialization=9.226&specialization=9.307&specialization=9.22&specialization=9.115&specialization=9.67&specialization=9.139&specialization=9.238&specialization=9.452&specialization=16.194&area=1

Post with problem

Wrong code from line 277 in Chrome incognito tab (error):

Quote

У вас слишком много отобранных вакансий. Вам нужно удалить ненужные вакансии из списка отобранных, чтобы добавить ещё одну.

Удалить самую старую вакансию и добавить эту
Отменить

Correct content that actually must be fetched from line 288 (vacancy description):

Quote

Обязанности:

  • Создание системы управления и контроля сроками реализации проектов инвестиционной программы Общества.
  • Методологическая поддержка системы...

For example, this is correctly mobilized page.

Here is an answer from hh.ru support (in russian, this is not their problem).

 

Thanks.

Share this post


Link to post
Share on other sites

Hello, 

Thank you for this comprehensive feedback. 

We "fine-tuned" the full content fetcher and newly arrived articles should be fetched fine. 

Please monitor the feed in the next couple of days and let us know if you are still receiving these errors for some articles. 

Share this post


Link to post
Share on other sites

It looks like they have some protection which triggers sometimes when the mobilizer tries to fetch the full content. There is a lot of javascripts and cookies if you load the articles directly in browser and the full content fetcher is not browser. It can't behave same way. 

I've tested the articles here and the 1st one was fetched properly but the second one not. 

We'll try to fine tune little bit further but can't promise 100% success. 

 

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now