Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Preliminary Worker Evaluation Tasks #31

Open
mbernst opened this issue Dec 22, 2017 · 2 comments
Open

Preliminary Worker Evaluation Tasks #31

mbernst opened this issue Dec 22, 2017 · 2 comments

Comments

@mbernst
Copy link

mbernst commented Dec 22, 2017

Problem

Requesters have multiple times requested the ability to filter workers based on early performance. This practice of "starter tasks" is a common one: workers' performance on the first few tasks qualifies them to gain access to the rest of the work. Gold standard tasks are a related technique. Concretely, both enable a requester to pitch a subset of their data as training tasks for new workers. Either these have known answers (like gold tasks), or can be manually reviewed by the requester, to gate people into their full task. Both approaches seem like they could be integrated into Boomerang.

Proposal

This proposal suggests that we begin prototyping starter tasks to see if they would help requesters. We would need to answer questions like: is there a way to do this without a lot of extra work on the requester's part? Does this bring substantial benefit to requesters, enough to be worth the time and effort of designing and implementing it? If it proves helpful, we would integrate it into the platform. If not, we could set the idea aside for now. The path through executing this strategic proposal will be determined by our findings.

Implications

Short-term implications: we would gain insight into whether non-expert requesters really benefit from the ability to create starter tasks, or if this is better served by other mechanisms such as prototype tasks.

Long-term implications: This might add to the complexity of the authoring interface. The question will be whether the improved experience is worth that complexity.

Contact

@michael Bernstein on Slack


To officially join in, add yourself as an assignee to the proposal. To break consensus, comment using this template. To find out more about this process, read the how-to.

@mbernst mbernst added this to the Strategy milestone Dec 22, 2017
@mbernst mbernst self-assigned this Dec 22, 2017
@markwhiting markwhiting self-assigned this Dec 22, 2017
@neilthemathguy neilthemathguy self-assigned this Dec 23, 2017
@neilthemathguy
Copy link

neilthemathguy commented Dec 23, 2017

Could you provide more details:

Prototype tasks exist because requesters can be at fault--- they launch unclear task designs that confuse even earnest workers, under-specify edge cases, and neglect to include examples.
Gaikwad et al 2017.

How do we know that these starter tasks will not lead to poor results?

@neilthemathguy neilthemathguy changed the title Starter tasks Preliminary Worker Evaluation Tasks under the umbrella of Prototype Tasks Dec 23, 2017
@mbernst
Copy link
Author

mbernst commented Dec 24, 2017

Yes, the requester need here might be addressed through prototype tasks, or might not --- I am not yet convinced either way.

The details you are asking for, I think:

  1. Requester puts task on Daemo
  2. Task goes through prototype process, succeeds and launches
  3. Workers from across Daemo now do the task

So once we've hit Stage 3 above and the task is fully live on Daemo, the challenge is that a new worker can arrive without ever having done the prototype task. Requesters desire ways to conditionally filter workers into the task based on performance.

One option would be to have all new workers to the task go through the prototype first. Now it's not exactly what prototype tasks were designed for, but it would match research on putting all interviews through the exact same script before making a decision.

If we do this, I would suggest not calling it a prototype task, because what's being described here is the same as prior research (e.g., gold standard tasks), whereas prototype tasks are novel. Adjusting the title to reflect that. We could instead point out how it integrates with prototype tasks, or something like that?

Sorry, that was rambly. Please ask for more precision on anything where I'm confusing.

@mbernst mbernst changed the title Preliminary Worker Evaluation Tasks under the umbrella of Prototype Tasks Preliminary Worker Evaluation Tasks Dec 24, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants