v1 Scheduled Jobs / Third Party Sync Migration #171

mattwilshire · 2024-10-23T11:46:26Z

Adds the ability to create scheduled jobs (currently only for third party syncs) that run off of a cron schedule.
Third party sync configurations are pulled from the application.properties and pushed to a new scheduled_job table.
The ScheduledJobManager handles the creation and deletion of jobs.
The job listener updates the status of each job pre and post execution.
New endpoints for fetching, triggering and toggling scheduled jobs.

…ated in application.properties

…val / updating. Added job and trigger listeners to handle updating the job status before and after execution.

…sert race condition exception.

…owConcurrentExecution annotation can be applied to the job.

… job properties

…isabled.

…t the same time

…vel, Made endpoints require POST request

webapp/src/main/resources/db/migration/V71__Add_scheduled_jobs.sql

byronantak · 2024-10-24T09:35:24Z

webapp/src/main/java/com/box/l10n/mojito/entity/ScheduledJob.java

+  @Column(name = "end_date")
+  private Date endDate;
+
+  @Column(name = "enabled", nullable = false, columnDefinition = "TINYINT DEFAULT 1")


I don't see this column definition used in the repo before. Is it necessary?

Also is the nullable false necessary since you default the field value to false?

Ideally, we would use the "BIT" column type here - as that is the closest semantical equivalent for storing a bool in MySQL and is the smallest amount of space used.

https://dev.mysql.com/doc/refman/8.0/en/bit-type.html

webapp/src/main/java/com/box/l10n/mojito/entity/ScheduledJobStatusEntity.java

byronantak · 2024-10-24T09:45:28Z

webapp/src/main/java/com/box/l10n/mojito/rest/scheduledjob/ScheduledJobWS.java

+    JobKey jobKey = scheduledJobManager.getJobKey(scheduledJob);
+
+    try {
+      if (!scheduledJobManager.getScheduler().checkExists(jobKey))


You get the scheduler 3 distinct times in this method, would it not be possible to store it in a variable and just reuse the result? Or does the scheduler vary a lot and you need to wait to the last possible minute to get it each time?

byronantak · 2024-10-24T09:46:50Z

webapp/src/main/java/com/box/l10n/mojito/rest/scheduledjob/ScheduledJobWS.java

+    }
+  }
+
+  @RequestMapping(method = RequestMethod.POST, value = "/api/jobs/{id}/toggle")


Nit: toggle is a strange term for this. I get that you mean it disable and enable a job, but this name feels confusing

Honestly I have a preference for separate endpoints for enable and disable because this current endpoint expects the called to know what the previous state was. This could make debugging very difficult.

Example: I called toggled on a job, but I used the wrong job's id. Now the output could either be that it is toggle or not toggled depending on its prior state, which we don't know since this was a mistake. It adds a layer of "what was it before? because that dictates what it is now". Where as the if the endpoint is /enable you know if the status code is 200, then it is enabled now. No confusion, no mental gymnastics required

Sounds good, I'll change it to two separate endpoints!

webapp/src/main/java/com/box/l10n/mojito/rest/scheduledjob/ScheduledJobWS.java

byronantak · 2024-10-24T10:08:57Z

webapp/src/main/java/com/box/l10n/mojito/service/scheduledjob/jobs/ScheduledThirdPartySync.java

+  @Autowired ServerConfig serverConfig;
+
+  @Value(
+      "${l10n.scheduledJobs.thirdPartySync.notifications.title:MOJITO | Third party sync failed for {repository}}")


Sorry, I am a bit of a loss here. I know that the ${} syntax loops in the application properties to get the value but how does the {repository} bit work? That's not received from app properties and there doesn't appear to be a variable in scope that it can read from...

If you use ${} in the application properties it will evaluate it on startup, if it doesn't find the variable, the process will crash straight away (IIRC). Other approach is to use {0}, {1} but that relies on the message having those parameters which isn't flexible

See:
String title = StrSubstitutor.replace( notificationTitle, ImmutableMap.of("repository", scheduledJob.getRepository().getName()), "{", "}");

If {repository} is in the custom title it will be replaced with the repository name that did the third party sync.

byronantak · 2024-10-24T10:10:59Z

webapp/src/main/java/com/box/l10n/mojito/service/scheduledjob/jobs/ScheduledThirdPartySync.java

+      "${l10n.scheduledJobs.thirdPartySync.notifications.title:MOJITO | Third party sync failed for {repository}}")
+  String notificationTitle;
+
+  private ScheduledJob scheduledJob;


Also a possible misunderstanding but why do we need to store these variables as class level variables? Does the context not have the details to find them again? 🤔

I don't know this library but I'm mainly worried about side-effects.

The notification title is pulled in from the application properties, you can have different PagerDuty titles for different environments. The ScheduledJob here is set on execution, we pull it out of the database on the execute method, when the success or failure method is called by the listener we can reference the job as its tied to the instance.

byronantak · 2024-10-24T10:12:40Z

webapp/src/main/java/com/box/l10n/mojito/service/scheduledjob/jobs/ScheduledThirdPartySync.java

+  @Override
+  public void execute(JobExecutionContext jobExecutionContext) throws JobExecutionException {
+    // Fetch the scheduled job and cast the properties
+    scheduledJob = scheduledJobRepository.findByJobKey(jobExecutionContext.getJobDetail().getKey());


Should this not be the try as well?
What if the cast fails or if the key is not found (by some misfortune)

The only way for this not to exist if there was some manual altering of the database, the manager creates a job for reach scheduled job so it must exist otherwise it would never even get to this execute method.

webapp/src/main/java/com/box/l10n/mojito/service/scheduledjob/jobs/ScheduledThirdPartySync.java

byronantak · 2024-10-24T10:17:38Z

webapp/src/main/java/com/box/l10n/mojito/service/scheduledjob/jobs/ScheduledThirdPartySync.java

+              try {
+                pd.triggerIncident(scheduledJob.getId(), payload);
+              } catch (PagerDutyException e) {
+                logger.error(


You take a lot of time and effort to build up all those useful urls above and if the incident fails to trigger, then you just discard it?
Can we not log it too to save us the trouble of reverse engineering the links? (It might be included in the payload, I'm not sure. Just the thought which was triggered)

webapp/src/main/java/com/box/l10n/mojito/service/scheduledjob/ScheduledJobStatusRepository.java

…'t do

garionpin · 2024-10-25T14:47:09Z

webapp/src/main/java/com/box/l10n/mojito/entity/ScheduledJob.java

+  @Column(name = "cron")
+  private String cron;
+
+  @Transient private ScheduledJobProperties properties;


Out of curiosity - is the use of @transient here to do with some desired serialization behaviour?

Yes that's exactly what its for in this instance, the transient here indicates to the JPA that the field should not be persisted to the database and when the row is pulled from the db, the deserializeProperties method in the entity is called as it has the PostLoad annotation. The properties string from the db is deserialized into this transient field. The setter of this transient field converts itself back into a JSON string which is stored in the propertiesString field that is persisted to the db.

webapp/src/main/java/com/box/l10n/mojito/entity/ScheduledJob.java

webapp/src/main/resources/db/migration/V71__Add_scheduled_jobs.sql

garionpin · 2024-10-25T15:00:28Z

webapp/src/main/resources/db/migration/V71__Add_scheduled_jobs.sql

+INSERT INTO scheduled_job_type(name) VALUES('THIRD_PARTY_SYNC');
+INSERT INTO scheduled_job_status_type(name) VALUES('SCHEDULED'), ('IN_PROGRESS'), ('FAILED'), ('SUCCEEDED');
+
+CREATE TABLE scheduled_job_aud (


Small note to track that changes to the base table column types also get reflected the _aud version

webapp/src/main/java/com/box/l10n/mojito/entity/ScheduledJob.java

garionpin · 2024-10-25T15:12:29Z

webapp/src/main/java/com/box/l10n/mojito/entity/ScheduledJobTypeEntity.java

+@Entity
+@Table(name = "scheduled_job_type")
+@Audited(targetAuditMode = NOT_AUDITED)
+public class ScheduledJobTypeEntity extends BaseEntity {


For the actual entity class names - we don't tend to suffix them with the word "Entity", e.g.:
public class Locale extends BaseEntity (vs. LocaleEntity)

I ended up using Entity for the suffix here as I have another class labelled ScheduledJobType in another package that I created (You will see it further on in the PR). Would the best approach be to rename that ScheduledJobType to something else so that the entity version can use it ? I'm thinking of calling the current one : ScheduledJobTypeEnum as the enum currently uses that name.

garionpin · 2024-10-25T15:23:05Z

webapp/src/main/java/com/box/l10n/mojito/rest/scheduledjob/ScheduledJobWS.java

+  public ResponseEntity<ScheduledJobResponse> triggerJob(@PathVariable String id) {
+


The id here should probably be of type long instead of string

The id here is a UUID. However, I have changed the data type here to be a UUID so that Spring will respond with a 400 Bad Request if it isn't a valid UUID passed through.

mattwilshire added 27 commits October 22, 2024 10:43

Added a scheduled job manager to handle the creation of cron jobs loc…

0d314b6

…ated in application.properties

Added main logic, storing jobs to database and parsing them on retrie…

7aaed92

…val / updating. Added job and trigger listeners to handle updating the job status before and after execution.

Added ScheduledJobDTO for API endpoints and handled a possible row in…

9edb9fe

…sert race condition exception.

Fix failing maven tests.

fd73ff5

Added SQL migration. Removed isAllowedExecutionOverlap, as the Disall…

3fb087b

…owConcurrentExecution annotation can be applied to the job.

Changed conditional property to match current quartz condition.

2b17972

Attached the job properties to the Job Type in the enum definition

d1cba21

Added Slack Notification for failed third party sync

3e49061

Fix failing maven tests

6a6d61c

Trigger mvn tests again

2c2d80c

Fix maven tests ?

6bbe94d

Fix exception thrown if @value is missing

ae0fff4

Changed condition on property value

8dea68d

Send pollable task ID in notification and other small changes.

427b4f0

Denormalized job type and job status. Added version to base scheduled…

14ba12b

… job properties

Cleanup & Set end / start date to null depending on current job status

4aeaccf

Added more ScheduledJob endpoints and veto the job execution if its d…

85f141d

…isabled.

Added ScheduledJobResponse for structured API responses

30dac28

Use UUID for job id

9a42a6b

Fix failing tests

4ce23ff

Fix deadlock that would occur in audit table when two syncs started a…

3efecaf

…t the same time

Clean up map

246597e

Refactoring & Added PagerDuty incident creation on job failure

348a15d

Add flyway schema & Add PD incident title to configuration

bcaef3f

Added configuration examples, Fix deadlock occurring at the method le…

f44b540

…vel, Made endpoints require POST request

Comment changes

40ddd6f

Add starting tests, ensuring they work in HSQL

7f41168

mattwilshire requested review from maallen, byronantak and garionpin October 23, 2024 11:46

byronantak reviewed Oct 24, 2024

View reviewed changes

webapp/src/main/resources/db/migration/V71__Add_scheduled_jobs.sql Outdated Show resolved Hide resolved

byronantak reviewed Oct 24, 2024

View reviewed changes

webapp/src/main/java/com/box/l10n/mojito/entity/ScheduledJobStatusEntity.java Show resolved Hide resolved

byronantak reviewed Oct 24, 2024

View reviewed changes

webapp/src/main/java/com/box/l10n/mojito/rest/scheduledjob/ScheduledJobWS.java Show resolved Hide resolved

byronantak reviewed Oct 24, 2024

View reviewed changes

webapp/src/main/java/com/box/l10n/mojito/service/scheduledjob/jobs/ScheduledThirdPartySync.java Show resolved Hide resolved

byronantak reviewed Oct 24, 2024

View reviewed changes

webapp/src/main/java/com/box/l10n/mojito/service/scheduledjob/ScheduledJobStatusRepository.java Show resolved Hide resolved

mattwilshire added 3 commits October 25, 2024 12:38

Fix tests by not using new application context

fff8a9f

Fix tests by manually adding rows flyway would handle that HSQL doesn…

fc8cdaf

…'t do

Review comments to fix up SQL schema

682c2bc