summaryrefslogtreecommitdiffstats
path: root/src/ceph/doc/cephfs/best-practices.rst
blob: 79c638eb39c6afee6979acf03654a4c9bde564dd (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
CephFS best practices
=====================

This guide provides recommendations for best results when deploying CephFS.

For the actual configuration guide for CephFS, please see the instructions
at :doc:`/cephfs/index`.

Which Ceph version?
-------------------

Use at least the Jewel (v10.2.0) release of Ceph.  This is the first
release to include stable CephFS code and fsck/repair tools.  Make sure
you are using the latest point release to get bug fixes.

Note that Ceph releases do not include a kernel, this is versioned
and released separately.  See below for guidance of choosing an
appropriate kernel version if you are using the kernel client
for CephFS.

Most stable configuration
-------------------------

Some features in CephFS are still experimental.  See
:doc:`/cephfs/experimental-features` for guidance on these.

For the best chance of a happy healthy filesystem, use a **single active MDS** 
and **do not use snapshots**.  Both of these are the default.

Note that creating multiple MDS daemons is fine, as these will simply be
used as standbys.  However, for best stability you should avoid
adjusting ``max_mds`` upwards, as this would cause multiple
daemons to be active at once.

Which client?
-------------

The fuse client is the easiest way to get up to date code, while
the kernel client will often give better performance.

The clients do not always provide equivalent functionality, for example
the fuse client supports client-enforced quotas while the kernel client
does not.

When encountering bugs or performance issues, it is often instructive to
try using the other client, in order to find out whether the bug was
client-specific or not (and then to let the developers know).

Which kernel version?
~~~~~~~~~~~~~~~~~~~~~

Because the kernel client is distributed as part of the linux kernel (not
as part of packaged ceph releases),
you will need to consider which kernel version to use on your client nodes.
Older kernels are known to include buggy ceph clients, and may not support
features that more recent Ceph clusters support.

Remember that the "latest" kernel in a stable linux distribution is likely
to be years behind the latest upstream linux kernel where Ceph development
takes place (including bug fixes).

As a rough guide, as of Ceph 10.x (Jewel), you should be using a least a
4.x kernel.  If you absolutely have to use an older kernel, you should use
the fuse client instead of the kernel client.

This advice does not apply if you are using a linux distribution that
includes CephFS support, as in this case the distributor will be responsible
for backporting fixes to their stable kernel: check with your vendor.

Reporting issues
----------------

If you have identified a specific issue, please report it with as much
information as possible.  Especially important information:

* Ceph versions installed on client and server
* Whether you are using the kernel or fuse client
* If you are using the kernel client, what kernel version?
* How many clients are in play, doing what kind of workload?
* If a system is 'stuck', is that affecting all clients or just one?
* Any ceph health messages
* Any backtraces in the ceph logs from crashes

If you are satisfied that you have found a bug, please file it on
http://tracker.ceph.com.  For more general queries please write
to the ceph-users mailing list.