Compilation of record functional modification is costly #6217

vicuna · 2013-10-30T14:08:35Z

Original bug ID: 6217
Reporter: @ppedrot
Assigned to: @gasche
Status: resolved (set by @gasche on 2016-06-26T15:39:23Z)
Resolution: fixed
Priority: normal
Severity: tweak
Version: 4.01.0
Target version: 4.03.1+dev
Fixed in version: 4.04.0 +dev / +beta1 / +beta2
Category: back end (clambda to assembly)
Tags: junior_job
Monitored by: @bobzhang @gasche @hcarty

Bug description

Currently, compilation of the { r with ... } expression seems to have a length threshold that triggers a change in the way it is compiled. For instance,

type t = {
t1 : int option;
t2 : int option;
t3 : int option;
t4 : int option;
t5 : int option;
}

let set1 x t = {
t with t1 = Some x;
}

produces a direct allocation on the minor heap, together with a bunch of assembly moves that copy the record fieldwise. Problem is, adding a [t6 : int option] field changes this to an [Obj.dup] followed by a [caml_modify].

I don't know if this has been decided with respect to some benchmarks, but [caml_modify] is really costly, and for the record (no pun intended), this overhead was indeed observable in Coq. Maybe this threshold should be incremented?

vicuna · 2014-09-15T21:17:24Z

Comment author: @damiendoligez

There is another problem with this: if the record is immutable it's possible that, in the near future, the call to [caml_modify] will break the GC's invariants.

vicuna · 2014-09-17T08:50:27Z

Comment author: bobot

The current heuristic is 3 + 2 * number_of_modified_file >= number_of_field for the piece by piece copy (transl_record in translcore.ml)

Instead of modifying this heuristic and for taking into account Damien's comment, can't we just replace the setfield:

Psetfield(lbl.lbl_pos, maybe_pointer expr) by
Psetfield(lbl.lbl_pos, false)

Since we are sure that the values assigned are older than the duplicated record?

vicuna · 2014-12-05T23:06:44Z

Comment author: @gasche

Note: François Pottier also hit this limit when experimenting with converting the Menhir parser generator from a mutable field update to a pure record update. While this comes at basically no cost if the state of the parser is a record of 5 fields, changing it to 6 fields gives an overall 30% performance overhead (in the generated parsers).

vicuna · 2015-01-15T18:00:22Z

Comment author: @damiendoligez

@bobot, it's not enough to be sure that the assigned values are older than the record, you also have to be sure that the record is in the minor heap.

vicuna · 2015-12-04T20:01:43Z

Comment author: madroach

@doligez
Wouldn't this be as easy to do as comparing the record size to Max_young_wosize ?

vicuna · 2016-04-10T18:19:12Z

Comment author: Otini

I have run some benchmarks, and fieldwise copy is indeed always faster while the new record is allocated on the minor heap, that is, while it has less than 256 fields. For bigger records (up to 513 fields) the second method is 1.5 to 4 times faster.

vicuna · 2016-06-26T15:39:23Z

Comment author: @gasche

This issue was solved by merging Otini's pull request in trunk ( b3216f0 )

#538

Now the immutable field-by-field copy is always preferred below max_young_wosize (the maximal size of objects that go in the minor heap), that is currently 256.

(The pull request is a bit noisy with three commits instead of one, apologies for forgetting to rebase.)

vicuna closed this as completed Jun 26, 2016

vicuna added back-end newcomer-job labels Mar 14, 2019

vicuna added this to the 4.03.1 milestone Mar 14, 2019

vicuna assigned gasche Mar 14, 2019

vicuna added the bug label Mar 20, 2019

xavierleroy mentioned this issue Feb 9, 2023

ocamllex causes assembler errors ("value too large for field") on arm32 and ppc64 #11948

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compilation of record functional modification is costly #6217

Compilation of record functional modification is costly #6217

vicuna commented Oct 30, 2013 •

edited by nojb

vicuna commented Sep 15, 2014

vicuna commented Sep 17, 2014

vicuna commented Dec 5, 2014

vicuna commented Jan 15, 2015

vicuna commented Dec 4, 2015

vicuna commented Apr 10, 2016

vicuna commented Jun 26, 2016

Compilation of record functional modification is costly #6217

Compilation of record functional modification is costly #6217

Comments

vicuna commented Oct 30, 2013 • edited by nojb

Bug description

vicuna commented Sep 15, 2014

vicuna commented Sep 17, 2014

vicuna commented Dec 5, 2014

vicuna commented Jan 15, 2015

vicuna commented Dec 4, 2015

vicuna commented Apr 10, 2016

vicuna commented Jun 26, 2016

vicuna commented Oct 30, 2013 •

edited by nojb