From: "Milan Stanojević" <milanst@gmail.com>
To: Martin Jambon <martin.jambon@ens-lyon.org>
Cc: Jeremy Yallop <yallop@gmail.com>, Caml List <caml-list@inria.fr>
Subject: Re: [Caml-list] segfault in simple program with 4.02 native
Date: Sat, 6 Sep 2014 02:00:40 -0400 [thread overview]
Message-ID: <CAKR7PS_brNtfaYmL+tSFCt=DnRX3jaQ=+5x-98pp=5JqXpKXtg@mail.gmail.com> (raw)
In-Reply-To: <540AA0DB.1040202@ens-lyon.org>
Could you do a dirty trick where you define a record that is the same
as the one you have now except that is has mutable fields, then you do
your parsing like now and then at the end return a record with
immutable fields (using (Obj.magic mutable : immutable)? You just need
to make sure that your mutable record doesn't escape your code.
On Sat, Sep 6, 2014 at 1:51 AM, Martin Jambon
<martin.jambon@ens-lyon.org> wrote:
> On Fri 05 Sep 2014 05:12:44 PM PDT, Jeremy Yallop wrote:
>>
>> On 6 September 2014 00:39, Martin Jambon <martin.jambon@ens-lyon.org>
>> wrote:
>>>
>>> That code is generated by atdgen. What happens is that we have to either
>>> create an empty record when starting to parse a list of unordered JSON
>>> fields, or use a bunch `let <field name> = ref None in` for each field
>>> and
>>> create the record in the end. While the latter approach is not much more
>>> work to implement, the resulting code was found to be significantly
>>> slower.
>>>
>>> The reason why it's using `Obj.magic 0.0` is that it worked in all cases
>>> (and has been for the past 4 years). Obtaining a well-formed constant
>>> value
>>> for any type is not trivial, so this what we have.
>>>
>>> It's very possible that it's now broken with OCaml 4.02. First try a
>>> 'make
>>> test' from atdgen's source directory (https://github.com/mjambon/atdgen)
>>> and
>>> see if it passes.
>>
>>
>> It does seem to be broken, and the change in behaviour with 4.0.2 is
>> apparently due to improved constant propagation
>> (http://caml.inria.fr/mantis/view.php?id=5779).
>>
>> The compiler now takes more advantage of immutability to improve the
>> memory usage and performance of programs. It's safe (or ought to be
>> safe) to assume that immutable record fields are never updated, so the
>> values used to initialize the fields can be propagated to other parts
>> of the program. Here's a small example that shows the change in
>> behaviour between 4.01 and 4.02.
>>
>> type t = { s : string }
>> let x = { s = "one" }
>> let () = Obj.(set_field (repr x) 0 (repr "two"))
>> let () = print_endline x.s
>>
>> Using OCaml 4.01 the third line overwrites the field 's' and the
>> fourth line reads the updated field and prints "two". Using OCaml
>> 4.02 the initial value of the field is propagated past the write to
>> the code in the fourth line, so the program prints "one".
>>
>> The code currently generated by atdgen assumes that it's safe to treat
>> fields as if they were mutable -- that is, it assumes that it's safe
>> to initialize a field with a value of the wrong type, so long as the
>> value is overwritten before the field is first read. I don't think
>> such tricks were ever explicitly guaranteed to work, but they're now
>> much more likely to fail, leading to the dummy initial value being
>> accessed at an inappropriate type.
>
>
> Thanks for the explanation, Jeremy. I guess atdgen will have to use "option
> refs" after all unless someone has a better idea.
>
> ATD definition:
>
> type t = {
> ?field0: foo option;
> ~field1: string;
> field2: int;
> }
>
> Generated OCaml code:
>
> let field0 = ref None in
> let field1 = ref "" in
> let field2 = ref None in
> ...
> (* parse json fields coming in an unknown order *)
> ...
> {
> field0 = !field0;
> field1 = !field1;
> field2 = (match !field2 with None -> error ... | Some x - >x);
>
> }
>
>
> --
> Caml-list mailing list. Subscription management and archives:
> https://sympa.inria.fr/sympa/arc/caml-list
> Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
> Bug reports: http://caml.inria.fr/bin/caml-bugs
next prev parent reply other threads:[~2014-09-06 6:01 UTC|newest]
Thread overview: 25+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-09-05 21:33 Ashish Agarwal
2014-09-05 21:50 ` Andy Ray
2014-09-05 21:56 ` Richard W.M. Jones
2014-09-05 22:01 ` Sebastien Mondet
2014-09-05 22:06 ` Ashish Agarwal
2014-09-05 22:13 ` Richard W.M. Jones
2014-09-05 22:18 ` Richard W.M. Jones
2014-09-05 22:36 ` Török Edwin
2014-09-05 22:39 ` Martin Jambon
2014-09-05 23:39 ` Ashish Agarwal
2014-09-05 23:59 ` Martin Jambon
2014-09-06 0:12 ` Jeremy Yallop
2014-09-06 5:51 ` Martin Jambon
2014-09-06 6:00 ` Milan Stanojević [this message]
2014-09-06 7:46 ` Frédéric Bour
2014-09-06 19:15 ` Martin Jambon
2014-09-06 19:08 ` Martin Jambon
2014-09-06 20:31 ` David MENTRÉ
2014-09-06 21:57 ` Martin Jambon
2014-09-07 7:34 ` David MENTRÉ
2014-09-07 18:47 ` Alain Frisch
2014-09-08 1:28 ` Martin Jambon
2014-09-13 10:26 ` Martin Jambon
2014-09-14 7:41 ` Martin Jambon
2014-09-05 22:18 ` Christoph Höger
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAKR7PS_brNtfaYmL+tSFCt=DnRX3jaQ=+5x-98pp=5JqXpKXtg@mail.gmail.com' \
--to=milanst@gmail.com \
--cc=caml-list@inria.fr \
--cc=martin.jambon@ens-lyon.org \
--cc=yallop@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox