* Type visibility limitation in recursive modules
@ 2007-12-23 14:24 "Márk S. Zoltán"
2007-12-23 17:20 ` [Caml-list] " Peng Zang
2007-12-24 11:44 ` "Márk S. Zoltán"
0 siblings, 2 replies; 8+ messages in thread
From: "Márk S. Zoltán" @ 2007-12-23 14:24 UTC (permalink / raw)
To: caml-list
Behold the following code snippet:
module rec Aleph :
sig
type t = Tag of Beth.t
end =
struct
type t = Tag of Beth.t
end
and Beth :
sig
type t
val scandalous : Aleph.t
end =
struct
type t = Aleph.t list
let scandalous = Aleph.Tag((* HERE BE DRAGONS *) [])
end
The compiler complains that the empty list after HERE BE DRAGONS has
type 'a list but it is used with type Beth.t. I'd expect the compiler to
discover that those types can be unified via 'a == Aleph.t, since as far
as I can tell, all necessary facts are visible at that point. Replacing
"Beth : sig type t ..." with "Beth : sig type t = Aleph.t list ..." does
away with the error, but I would very much like to keep Beth.t an opaque
type. Is this an intended behavior? And if it is intended, is there a
way to tell the compiler what I really want it to do?
Thanks in advance for any help.
Z-
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Type visibility limitation in recursive modules
2007-12-23 14:24 Type visibility limitation in recursive modules "Márk S. Zoltán"
@ 2007-12-23 17:20 ` Peng Zang
2007-12-23 18:00 ` "Márk S. Zoltán"
2007-12-24 11:44 ` "Márk S. Zoltán"
1 sibling, 1 reply; 8+ messages in thread
From: Peng Zang @ 2007-12-23 17:20 UTC (permalink / raw)
To: caml-list; +Cc: Márk S. Zoltán
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
I do not know if this is the intended behavior. It may very well be.
However, it appears we can easily work around this by simply abstracting the
type later. Eg.
module rec Aleph :
sig
type t = Tag of Beth.t
end =
struct
type t = Tag of Beth.t
end
and Beth :
sig
type t = Aleph.t list
val scandalous : Aleph.t
end =
struct
type t = Aleph.t list
let scandalous = Aleph.Tag((* HERE BE DRAGONS *) [])
end
module Beth = (Beth:sig type t val scandalous : Aleph.t end)
# Beth.scandalous;;
- - : Aleph.t = Aleph.Tag []
# Aleph.Tag [];;
- - : Aleph.t = Aleph.Tag []
# (Aleph.Tag []) = Beth.scandalous;;
- - : bool = true
Happy Holidays,
Peng
On Sunday 23 December 2007 09:24:55 am Márk S. Zoltán wrote:
> Behold the following code snippet:
>
> module rec Aleph :
> sig
> type t = Tag of Beth.t
> end =
> struct
> type t = Tag of Beth.t
> end
> and Beth :
> sig
> type t
> val scandalous : Aleph.t
> end =
> struct
> type t = Aleph.t list
> let scandalous = Aleph.Tag((* HERE BE DRAGONS *) [])
> end
>
> The compiler complains that the empty list after HERE BE DRAGONS has
> type 'a list but it is used with type Beth.t. I'd expect the compiler to
> discover that those types can be unified via 'a == Aleph.t, since as far
> as I can tell, all necessary facts are visible at that point. Replacing
> "Beth : sig type t ..." with "Beth : sig type t = Aleph.t list ..." does
> away with the error, but I would very much like to keep Beth.t an opaque
> type. Is this an intended behavior? And if it is intended, is there a
> way to tell the compiler what I really want it to do?
>
> Thanks in advance for any help.
>
> Z-
>
> _______________________________________________
> Caml-list mailing list. Subscription management:
> http://yquem.inria.fr/cgi-bin/mailman/listinfo/caml-list
> Archives: http://caml.inria.fr
> Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
> Bug reports: http://caml.inria.fr/bin/caml-bugs
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.7 (GNU/Linux)
iD8DBQFHbpj4fIRcEFL/JewRAiRrAJ9lkWenFBHkG2IwRTuRDVN92JCRPQCgjjZV
INwvriBZ/NF25J3ApX8lKn8=
=UpHJ
-----END PGP SIGNATURE-----
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Type visibility limitation in recursive modules
2007-12-23 17:20 ` [Caml-list] " Peng Zang
@ 2007-12-23 18:00 ` "Márk S. Zoltán"
0 siblings, 0 replies; 8+ messages in thread
From: "Márk S. Zoltán" @ 2007-12-23 18:00 UTC (permalink / raw)
To: Caml Mailing List
Peng Zang wrote:
> I do not know if this is the intended behavior. It may very well be.
> However, it appears we can easily work around this by simply abstracting the
> type later. Eg.
>
> module rec Aleph :
> sig
> type t = Tag of Beth.t
> end =
> struct
> type t = Tag of Beth.t
> end
> and Beth :
> sig
> type t = Aleph.t list
> val scandalous : Aleph.t
> end =
> struct
> type t = Aleph.t list
> let scandalous = Aleph.Tag((* HERE BE DRAGONS *) [])
> end
>
> module Beth = (Beth:sig type t val scandalous : Aleph.t end)
>
>
> # Beth.scandalous;;
> - - : Aleph.t = Aleph.Tag []
> # Aleph.Tag [];;
> - - : Aleph.t = Aleph.Tag []
> # (Aleph.Tag []) = Beth.scandalous;;
> - - : bool = true
>
> Happy Holidays,
>
> Peng
>
>
Agreed, this does work for the code on the outside, but I wanted to hide
the actual gory details of Beth.t from the implementation of Aleph too.
The real-life Aleph contains a huge mess of inherited code (inherited
from myself, but I wrote it years ago), plus some serious potential for
quasi-macaronic growth. I wanted to make sure that none of that code
makes assumptions about Beth.t, because the choice of Beth.t could
easily have serious performance implications, and I would like to
isolate the rest of the code from the impact of Beth.t turning overnight
into an Aleph.t array or worse. That can only be done if the
implementation of Beth is the only place where the details of Beth.t are
known.
This is not exactly a showstopper issue for me, though. The reason I
posted it was that (1) I suspect this is a bug (more likely a design
bug, not an implementation one), and I like finding bugs, or (2) this is
a pointer into something I really should understand about modules, and
with any help I will understand it soon, as more replies come in.
Happy Holidays,
Z-
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Type visibility limitation in recursive modules
2007-12-23 14:24 Type visibility limitation in recursive modules "Márk S. Zoltán"
2007-12-23 17:20 ` [Caml-list] " Peng Zang
@ 2007-12-24 11:44 ` "Márk S. Zoltán"
2007-12-27 23:13 ` "Márk S. Zoltán"
1 sibling, 1 reply; 8+ messages in thread
From: "Márk S. Zoltán" @ 2007-12-24 11:44 UTC (permalink / raw)
To: Caml Mailing List
Márk S. Zoltán wrote:
> Behold the following code snippet:
>
> module rec Aleph :
> sig
> type t = Tag of Beth.t
> end =
> struct
> type t = Tag of Beth.t
> end
> and Beth :
> sig
> type t
> val scandalous : Aleph.t
> end =
> struct
> type t = Aleph.t list
> let scandalous = Aleph.Tag((* HERE BE DRAGONS *) [])
> end
>
> The compiler complains that the empty list after HERE BE DRAGONS has
> type 'a list but it is used with type Beth.t. I'd expect the compiler
> to discover that those types can be unified via 'a == Aleph.t, since
> as far as I can tell, all necessary facts are visible at that point.
> Replacing "Beth : sig type t ..." with "Beth : sig type t = Aleph.t
> list ..." does away with the error, but I would very much like to keep
> Beth.t an opaque type. Is this an intended behavior? And if it is
> intended, is there a way to tell the compiler what I really want it to
> do?
>
> Thanks in advance for any help.
>
> Z-
The plot thickens. The following code provokes an error message stating
explicitly that type t in Beth is not the same as Beth.t:
----------------------
module rec Aleph :
sig
type t = Tag of Beth.t
end =
struct
type t = Tag of Beth.t
end
and Beth :
sig
type t
val a_beth_t_value : t
val scandalous : Aleph.t
end =
struct
type t = Aleph.t list
let a_beth_t_value = ([] : t)
let scandalous = Aleph.Tag(a_beth_t_value)
end
----------------------
$ ocamlc -i Anomaly.ml
File "Anomaly.ml", line 18, characters 33-49:
This expression has type t = Aleph.t list but is here used with type Beth.t
---------------------
The error message references the mention of a_beth_value in the let
scandalous ... line. The ([] : t) construct in the previous line was
necessary to trigger the 'This expression has type t = Aleph.t list ...'
message, since a bare [] only triggers "This expression has type Aleph.t
list but ...", without an indication that it knows that type t is
Aleph.t list.
However, reexporting a_beth_t_value from Aleph satisfies the compiler:
--------------------
module rec Aleph :
sig
type t = Tag of Beth.t
val a_reexported_beth_t_value : Beth.t
end =
struct
type t = Tag of Beth.t
let a_reexported_beth_t_value = Beth.a_beth_t_value
end
and Beth :
sig
type t
val a_beth_t_value : t
val scandalous : Aleph.t
end =
struct
type t = Aleph.t list
let a_beth_t_value = []
let scandalous = Aleph.Tag(Aleph.a_reexported_beth_t_value)
end
--------------------
$ ocamlc -i Anomaly.ml
module rec Aleph :
sig type t = Tag of Beth.t val a_reexported_beth_t_value : Beth.t end
and Beth : sig type t val a_beth_t_value : t val scandalous : Aleph.t end
--------------------
So far it sounds like the compiler acknowledges there is a type Beth.t
and identifies it with type t in the sig of Beth, but fails to make the
connection between this type and the Aleph.t list type expression inside
the struct of Beth. But even that is only true if the type expression in
question uses a built-in list; if I replace it with an algebraic type
implementing my own list, even the original example starts working.
-------------------
module rec Aleph :
sig
type t = Tag of Beth.t
end =
struct
type t = Tag of Beth.t
end
and Beth :
sig
type t
val scandalous : Aleph.t
end =
struct
type t = Cons of Aleph.t * t | Nil
let scandalous = Aleph.Tag(Nil)
end
-----------------
$ ocamlc -i Anomaly.ml
module rec Aleph : sig type t = Tag of Beth.t end
and Beth : sig type t val scandalous : Aleph.t end
-----------------
This is a bug, isn't it?
Cheers
Z-
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Type visibility limitation in recursive modules
2007-12-24 11:44 ` "Márk S. Zoltán"
@ 2007-12-27 23:13 ` "Márk S. Zoltán"
2007-12-28 10:02 ` Keiko Nakata
0 siblings, 1 reply; 8+ messages in thread
From: "Márk S. Zoltán" @ 2007-12-27 23:13 UTC (permalink / raw)
To: Caml Mailing List
The problem only impacts types declared via parametrized type
constructors. To wit:
This does not work:
----------------------------------------
module rec Aleph : sig type t = Tag of Beth.t end =
struct type t = Tag of Beth.t end
and Beth : sig type t val v : Aleph.t end =
struct type t = Aleph.t list let v = Aleph.Tag([]) end
This does:
----------------------------------------
module rec Aleph : sig type t = Tag of Beth.t end =
struct type t = Tag of Beth.t end
and Beth : sig type t val v : Aleph.t end =
struct type t = Nil | Cons of Aleph.t * t let v = Aleph.Tag(Nil) end
And this does not:
----------------------------------------
type 'a mylist = Nil | Cons of 'a * 'a mylist
module rec Aleph : sig type t = Tag of Beth.t end =
struct type t = Tag of Beth.t end
and Beth : sig type t val v : Aleph.t end =
struct type t = Aleph.t mylist let v = Aleph.Tag(Nil) end
I have spent a few (dozen) hours debugging the compiler, and came to the
conclusion that the reason for this bug is that "strenghtening" does not
take place for Tconstr's. Tconstr types use their type_manifest fields
to hold a description of the type, while Tvariant types set it to None.
When the typing process of recursive modules reaches the strenghtening
phase, it dutifully avoids updating Tconstr's because their
type_manifest matches Some _, not None. As a result, Tconstr type
implementations cannot be hidden inside a recursive module.
This issue has other interesting variations, e.g. this works:
-----------------------------------------------
module type AT = sig type t val v : t end
module type S = sig module Aleph : AT end
module Make = functor(A : AT) ->
(struct module Aleph = A end : S with module Aleph = A)
module rec Al : AT =
struct type t = string let v = "v" end
and Ex : S with module Aleph = Al = Make(Al)
This doesn't:
-----------------------------------------------
module type AT = sig type t val v : t end
module type S = sig module Aleph : AT end
module Make = functor(A : AT) ->
(struct module Aleph = A end : S with module Aleph = A)
module rec Al : sig type t val v : t end (* NB: AT!!! *) =
struct type t = string let v = "v" end
and Ex : S with module Aleph = Al = Make(Al)
-----------------------------------------------------
File "Anomaly.ml", line 34, characters 9-33:
In this `with' constraint, the new definition of Aleph
does not match its original definition in the constrained signature:
Modules do not match: sig type t = Al.t end is not included in AT
The field `v' is required but not provided
----------------------------------------------------
Cheers
Z-
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Type visibility limitation in recursive modules
2007-12-27 23:13 ` "Márk S. Zoltán"
@ 2007-12-28 10:02 ` Keiko Nakata
2007-12-28 12:46 ` "Márk S. Zoltán"
0 siblings, 1 reply; 8+ messages in thread
From: Keiko Nakata @ 2007-12-28 10:02 UTC (permalink / raw)
To: zoltan.s.mark; +Cc: caml-list
Hello.
I have not thoroughly read your posts,
but it looks like the double vision problem.
You may be interested in the following post to camllist:
http://caml.inria.fr/pub/ml-archives/caml-list/2007/06/0d23465b5b04f72fedecdd3bbf2c9d72.en.html
By the way, I believe one can hack the OCaml type checker
to (mostly) avoid the problem by "strengthening"
the external signature of the recursively defined module
in the type environment. But this is not particularly beautiful
since a type constructor may escape its scope,
although the implementation is kept sound.
I would be very happy if you could give me a practical example
which suffers from the double vision problem.
A happy new year!
Keiko
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Type visibility limitation in recursive modules
2007-12-28 10:02 ` Keiko Nakata
@ 2007-12-28 12:46 ` "Márk S. Zoltán"
2007-12-28 13:42 ` Keiko Nakata
0 siblings, 1 reply; 8+ messages in thread
From: "Márk S. Zoltán" @ 2007-12-28 12:46 UTC (permalink / raw)
To: Caml Mailing List
Keiko Nakata wrote:
> Hello.
>
> I have not thoroughly read your posts,
> but it looks like the double vision problem.
>
> You may be interested in the following post to camllist:
>
> http://caml.inria.fr/pub/ml-archives/caml-list/2007/06/0d23465b5b04f72fedecdd3bbf2c9d72.en.html
>
Thanks; somehow I haven't found that particular post when I was
searching for precedents. Not having managed to understand the double
vision problem, I still I happen to think it is not a good explanation
in this case for many reasons. Even Xavier agrees in the above linked
post that this is a bug of the current implementation, i.e. not a
theoretical issue. If it was due to some sort of theoretical difficulty,
variant types would fail at it too, but they don't. This is a plain bug
- the way type equations are stored is a kludge, as it uses the
type-manifest field which already plays an only distantly related role.
> By the way, I believe one can hack the OCaml type checker
> to (mostly) avoid the problem by "strengthening"
> the external signature of the recursively defined module
> in the type environment.
Something like this is actually done in the code - when typing the
structs, the encountered type declarations are strengthened to point to
themselves as seen from outside of the module. The problem is that this
is only doable for variant and record types due to legacy design
decisions in the actual implementation, namely that type abbreviations
are represented in the current implementation in a way which does not
allow storing the results of such strenghtening, and that the compiler
silently passes over a type it cannot strenghten this way, probably in
the hope of not causing hard-to-relate problems later. I really find it
hard to emphasize enough that this cannot be a theoretical impossibility
if it works for variant types.
Unless you are talking about some other kind of hack, in which case I'd
be very grateful to hear more details.
> But this is not particularly beautiful
> since a type constructor may escape its scope,
> although the implementation is kept sound
>
You mean like in http://caml.inria.fr/mantis/view.php?id=3993 ? The
situation is caught by the compiler, fortunately, so I don't feel
worried about it. I wonder, though, if there is any other way for this
escape to happen except the '_a-style delayed typing, like ref [], which
IMHO should be rejected as untypeable in the absence of explicit type
annotations, not courteously typed to '_a list.
> I would be very happy if you could give me a practical example
> which suffers from the double vision problem.
>
I could give you the skeleton of what I am trying to implement, which is a pattern matching & rewriting engine. I am not sure if it is a practical example of the double vision problem, since I don't know what it is.
module type NumbersS : sig ... end
module type AtomsS : sig ... end
module type ReprS :
sig
module Numbers : NumbersS
module Atoms : AtomsS
module rec Terms : sig type t = <variant type def here, using Seq.t> ... end
and Seq : sig type t ... end
end
module RepresentationF =
functor(N : NumbersS) ->
functor(A : AtomsS) ->
(struct
module Numbers = N
module Atoms = A
module rec Terms : sig ...<as above> end =
struct ...<stuff using Seq.t> ...end
and Seq : sig type t ... end =
struct type t = Terms.t list <or something awful like that>
...
end
end : ReprS with module Numbers = N and module Atoms = A)
module type RewriterS =
sig
module Numbers : NumbersS
module Atoms : AtomsS
module Representation : ReprS with module Atoms = Atoms and module Numbers = Numbers
...
end
module RewriterF =
functor(N : NumbersS) ->
functor(A : AtomsS) ->
functor(R : ReprS with module Atoms = A and module Numbers = N) ->
(struct
...
end : RewriterS with module Numbers = N and module Atoms = A and module Representation = R)
module MyNumbers : NumbersS = struct .. end
module rec MyAtoms : sig ... end <includes AtomsS and uses Driver.* stuff> =
struct ... end
and MyRepresentation : ReprS with module Numbers = MyNumbers and module Atoms = MyAtoms
= RepresentationF(MyNumbers)(MyAtoms)
and MyRewriter
: RewriterS with module Numbers = MyNumbers and module Atoms = MyAtoms and module Representation = MyRepresentation
= RewriterF(MyNumbers)(MyAtoms)(MyRepresentation)
and Driver : sig ... end = struct ... <stuff making use of all other modules> ... end
Basically, I want to
- separate the numeric part (plain OCaml numeric types vs. Bignum lib vs. ocamlgmp).
- postpone decisions about the actual structure of atoms (this code will be reused in many projects, and in some of them atoms may contain complex structures making use of Representation.Terms.t and Representation.Seq.t)
- isolate the implementation details of Seq from Terms (Seq.t: list? array? map?) and make sure Terms does not make assumptions about the Seq.t type (the struct of Terms is a huge mess of legacy code)
- keep the pattern-matching and rewriting bytecode engine isolated from the representation details
- keep the actual driver separate from the rewriter too (the same Rewriter could be used with different drivers, e.g. one with memoizing, one without, one with memoizing open to explicit insertion of memoized cases; the driver might make use of builtins for certain kinds of terms, sport a typing system or not - and it would probably store lots of such information in the atoms, not in maps from atoms to information).
Of course there are other, non-rec ways to get there, but I don't want to just yet. Apart from arriving to a more straightforward code using module rec, this is a hobby project, so I can afford to wait for a solution.
Cheers
Z-
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Type visibility limitation in recursive modules
2007-12-28 12:46 ` "Márk S. Zoltán"
@ 2007-12-28 13:42 ` Keiko Nakata
0 siblings, 0 replies; 8+ messages in thread
From: Keiko Nakata @ 2007-12-28 13:42 UTC (permalink / raw)
To: zoltan.s.mark; +Cc: caml-list
> Unless you are talking about some other kind of hack, in which case I'd
> be very grateful to hear more details.
Probably I was talking about something else.
(Please bear in mind that I've got this way of understanding the type checker
via hacking the implementation; so it is likely that my understanding
is incorrect.)
Let me consider:
module rec M : sig type t val x: t end = struct type t = int let x = M.x end
The type checker type checks the body of M,
i.e., struct type t = int let x = M.x end, while the type environment records
M's signature as "sig type t val x: t end". But in fact one could perhaps
enrich the signature to "sig type t = t val x: t end",
where the right-hand side of the abbreviation t = t is the type t
in the structure, not in the signature.
As you may know, the type checker can distinguish the two types named t,
since they internally carry different identities.
Since the type environments used to type check the body of M
and the rest of the program are distinct, there is no possibility that
type abstraction can be broken.
This is not beautiful, since the type t in the right side of "t = t"
escapes from its scope.
I should note that this is not my original idea :-)
Thank you for the program. I will take it back home and look at it
during the coming new year holidays.
Best,
Keiko
^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2007-12-28 13:42 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-12-23 14:24 Type visibility limitation in recursive modules "Márk S. Zoltán"
2007-12-23 17:20 ` [Caml-list] " Peng Zang
2007-12-23 18:00 ` "Márk S. Zoltán"
2007-12-24 11:44 ` "Márk S. Zoltán"
2007-12-27 23:13 ` "Márk S. Zoltán"
2007-12-28 10:02 ` Keiko Nakata
2007-12-28 12:46 ` "Márk S. Zoltán"
2007-12-28 13:42 ` Keiko Nakata
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox