Context-aware text filters (provide more meta information to the filter system) [#226963]

(completely rewritten summary by C_Logemann on November 27, 2012)

Introduction

The function called "checkup_markup" suggests that the drupal filter system is only for filtering out unwanted content. But it can and is being used for a kind of "constructive filtering" with changing and adding content.

Examples for "constructive filters"

The best examples for this are the core filters for changing linebreaks to html breaks or converting email and web addresses to html links.
Glossary modules (e.g. glossary), linkit, biblio, tableofcontents and footnotes are some examples where the filter system is used for power up content and realize knowledge management strategies.

The History of this feature request

At least the opening of the duplicate issue #8000: Filter contexts (and fix some small issues in filtering) by Gábor Hojtsy (in 2004) there is a need of meta information in the filter system with minimum the entity type and id like the first issue title already suggests: "Pass on node id to filter somehow".

Posted by Gábor Hojtsy on May 23, 2004 at 5:41pm

I have two custom modules now, which require the node id to be passed on to the filter process.
(removed old code and strategy suggestion, see old version of this issue description)

In 2008 this issue was opened by rötzi:

At the moment, a filter only gets a piece of text without any information about where this text comes from. Sometimes it can be useful to know where the text comes from to generate appropriate output. The example I have is the following:
You have your nodes grouped (e.g. by organic groups) and encounter a wikilink [[Page name]]. Now you would like to know in which group the current node is to link to the corresponding page in the same group.

I think it's time to get this solved in Drupal 8.

How more "meta information" can help?

Like Gabor I was at first dealing with node hooks to get meta Information to the text. In any case we have to deal with bad workarounds if we don't use the filter system. With "langcode" we got already a "meta information" to the filter system in Drupal 7 and more meta information can help much more.
Any information about the source of the text which has to be filtered can empower knowledge management filters a lot.
If a glossary or biblio filter can use meta information of the entity it can change behavior depending it is possibly a part of an organic group or it is connected to a special taxonomy vocabulary. Additional the filter system can "report" maybe to an own database table when the filter is used successfully to create statistics or provide internal backlinks to the entities.
If this is possible it would be nice to get information of the fields where the text is used with fieldname and delta information. In this way a filter could be used to operate across more than one field of an entity and generate for example unique IDs. The footnotes module is currently adding a random string to IDs which is a bad strategy if somebody wants to bookmark a footnote (#194558: Non-unique list item IDs).

strategies and suggestions

At least the patch of Gabor in 2004 provides the complete node object as an optional data to the filter. And also the the patches between 2008 and 2011 in this issue are following this idea (meantime with entities). And since more and more data becomes entities in drupal this is a good fundament for a lot of knowledge management empowerments for drupal through the filter system.

Maybe somebody have an idea how to provide fieldname and delta information?

There was an idea to change the function 'check_markup' to 'apply_filters' in this issue. This name change is not required for the functionality and this would be a bigger API change as it already is. Similar with the technology called "menu system" which is doing more than just dealing with menu items I think we can live with "check_markup".

Comment	File	Size	Author
#60	filter-context-226963-60.patch	37.64 KB	c-logemann
#59	filter-context-226963-59.patch	35 KB	c-logemann
#54	filter-context-226963-54.patch	34.94 KB	c-logemann
#46	filter-context-226963-46.patch	32.45 KB	cwgordon7
#43	filter-context-226963-43.patch	32.43 KB	cwgordon7
#42	filter-context-226963-42.patch	30.77 KB	cwgordon7
#38	filter-context-226963-38.patch	26.84 KB	cwgordon7
#34	filter-context-226963-34.patch	26.83 KB	cwgordon7
#32	filter_context_03.patch	27.6 KB	cwgordon7
#31	filter_context_02.patch	29.28 KB	cwgordon7
#17	filter_context_01.patch	14.56 KB	cwgordon7
	filters_with_context.patch	8.34 KB	rötzi

Comments

Comment #1

cwgordon7 commented 27 February 2008 at 18:13

Big +1! This would be awesome!!

Comment #2

Susurrus commented 2 March 2008 at 02:58

Status:

Active

» Needs review

I think makes a lot of sense. +1 here and a status update.

Comment #3

moshe weitzman commented 2 March 2008 at 12:51

Subscribe. I have felt the pain of missing context many times. +1.

Comment #4

wim leers

Ghent 🇧🇪🇪🇺

commented 3 March 2008 at 13:14

Subscribing.

Comment #5

chx commented 3 March 2008 at 14:24

I am inclined to won't fix. This is a task for nodeapi op view not filters. The very concept of filters is not being content aware. What will you do with something that's not a node...?

Comment #6

moshe weitzman commented 3 March 2008 at 14:31

The problem is that nodeapi view is not cached. We don't want to run nasty filter code on every request.

Comment #7

cwgordon7 commented 4 March 2008 at 01:10

I can see other use cases beyond nodes. For example, comments, block descriptions, etc., are all filtered; these could potentially all benefit from this.

And if this is node-specific, then we may need to reconsider how nodes are filtered by the filter system, because the current version just doesn't work.

Comment #8

chx commented 4 March 2008 at 03:30

Whatever. Feel free to ignore me on this.

Comment #9

cwgordon7 commented 10 May 2008 at 16:51

Status:

Needs review

» Needs work

patch no longer applies cleanly on HEAD.

Comment #10

sun

German

Karlsruhe

commented 22 June 2008 at 11:34

+1 on the feature.
-1 on renaming check_markup(), additional arguments are sufficient.
-1 on limiting this to nodes.

As outlined in Add context for check_markup() at g.d.o, check_markup() is used for blocks, comments, nodes, profile fields, and users in core currently (see actual occurrences over there).

The goal should be to pass any type of content (a string) along with the object itself (most often an object) to check_markup(). Leaving conditional code for handling a certain object to filters that care.

Comment #11

moshe weitzman commented 3 July 2008 at 21:43

I suppose that a module can use nodeapi('view') and do its own caching of the result. Same for non module operations. You don't *have* to use the filter system.

Comment #12

mariuss commented 3 July 2008 at 21:44

Here is an example where you need context, and it is not for a node but a taxonomy term:

The Glossary module uses taxonomies to store the glossary data. It provides a filter that will highlight and link words that are part of the glossary. Even glossary term descriptions have this filter applied. Ideally when looking at a glossary term you don't want that term to be highlighted in its own definition, it looks silly. But without some context it is impossible for the glossary filter to figure when words to ignore and when.

See: http://drupal.org/node/276198

Comment #13

sun

German

Karlsruhe

commented 3 July 2008 at 22:55

#8000: Filter contexts (and fix some small issues in filtering) has been marked as duplicate of this issue. Includes a patch as well.

Well, now that I recognize the issue number of that issue, I would rather want to mark this one duplicate... ;)

Comment #14

mfer commented 4 July 2008 at 15:23

Let's think about the scalability of this solution in the database. Currently, for every instance we have a database entry. If contexts were added in we would have to multiply that by the number of contexts for that instance in the most basic of cases. If two or more filters took advantage of contexts it would be a much larger growth rate than simple multiplication. This just doesn't seem scalable and could quickly grow out of control.

I can understand the issue of contexts. It's something I've personally struggled with.

What if, instead, we added a postprocess op to hook_filter (or a hook_filter_postprocess). When content is processed through filters we record the filters applied to it. When it's loaded we pass in the content to the postprocess op for the applied filters. This would let us do something similar to the hook_nodeapi view operation everywhere and let the filters define it for themselves. Thoughts?

Comment #15

sun

German

Karlsruhe

commented 2 August 2008 at 02:01

@mfer: There seems to be a misunderstanding - the goal of this issue is to pass information about the filtered text to all invoked filters. This does not change input formats or the number of filter per input format at all. But filters would finally be able to use the passed-in information intelligently, f.e. using $node->nid or $user->uid to render some markup that is tied to the object itself.

Steven also noted that filter caching could be a caveat, if some dumb filters try to use data outside of the cached content - however, for my use-case (Inline API) I just want to get some information passed in, so the contents of a filtered field are able to query some data and/or directly use values from the passed in object.

Comment #16

cwgordon7 commented 2 August 2008 at 03:03

Assigned:

Unassigned

» cwgordon7

Due to the comment by Steven, I feel I have no choice but to roll this patch to prove that I am not just subscribing and +1'ing and "fully expect[ing] others in the community to solve it, while I reap the benefits." :(

Patch later tonight. Not to be a reroll.

Comment #17

cwgordon7 commented 2 August 2008 at 05:15

Status:

Needs work

» Needs review

Status	File	Size
new	filter_context_01.patch	14.56 KB

This changes the filtering system to cache everything on a per-context basis.

Comment #18

mfer commented 2 August 2008 at 16:18

@cwgordon7, thanks for the patch. First, I like the idea of knowing more about the object in a filter. It's more a matter of making sure the solution doesn't break other parts of drupal.

One thing to note about this patch is that if in hook_nodeapi there is a conditional addition to the node object in the load operation there may be (at least) two different rows in the cache_filter table (which might, already, be the table with the most rows in drupals db). This wouldn't affect all conditional includes but it could for some. This leaves a window open for sloppy or unknowing developers to over fill the filter_cache table.

Is this an acceptable risk?

Comment #19

cwgordon7 commented 2 August 2008 at 18:09

@#18 - This is a definite and unavoidable side-effect, imo. By the very nature of being context-aware, if a node is loaded differently in two different requests, the filtered text itself may result in different text as well.

Another option would be to add a separate op to hook_nodeapi (case 'filter') in which only modules intending to use filtering on nodes with node contexts would add their stuff. I don't like that, however, as it doesn't really allow modules to use other modules' node properties effectively.

A third option would be to cache based on id (e.g. nid, cid, or bid). However, with this option, if a node or comment or block etc changes, the filter is still cached. I kind of like this option better; the only problem, really, is having to delete from the cache_filter table after node/comment/etc. updates where like '%%:%%:type:%d', and that's relatively easy to accomplish.

Other thoughts?

Comment #20

mfer commented 2 August 2008 at 19:32

Status:

Needs review

» Needs work

As a proof of the issue from #18, in comment_nodeapi a comment count is added to the node object. Each time there is a new comment added a new comment count will show up on the node and a new row will end up in cache_filter. On a large site like d.o this would be very bad.

If we attempt cache invalidation like the 3rd option in #19 what would the affects (performance) be and what would be the best way? Especially on a mysql myisam table with table level locking.

@cwgordon7 don't feel discouraged. This is a very tough problem to solve.

“There are only two hard things in Computer Science: cache invalidation and naming things” --Phil Karlton

Comment #21

cwgordon7 commented 2 August 2008 at 20:05

What if a filter relies on the comment count? :P

No, but the thing is, I agree with you. If something relies on the comment count, it can just case 'no cache': return TRUE;. Or we could add another hook_filter op, e.g. 'context dependent': return (bool);.

Thoughts?

Comment #22

nancydru

she/her/hers

English

Boston

commented 2 October 2008 at 19:07

+1 for the idea, and possibly for the rename.

BUTS:

I would rather see the context param moved to the end. By your own description, this is an optional parameter. There are going to be lots of changes in contribs just from the renaming; let's at least make them work with a global search-and-replace.
As Marius pointed out, I am very interested in filtering terms as well.
Perhaps it is unnecessary to mention this here, but filters do not operate on nodes; they operate on text. That text may come from nodes, but is not limited to that.
Can we not apply a cache lifetime to the filter cache to assist in limiting this table?

Comment #23

mfer commented 2 October 2008 at 19:55

@NancyDru Context aware filters are quite a complicated issue. The solution in the patch above needs some major rework.

Adding a cache lifetime isn't a solution to the problem of cache invalidation in this case. It's a kind of work around or hack that doesn't address how the system can get out of hand and where this is flawed. Even with a cache invalidation of 1 hour I can come up with cases that could kill the filter cache on a medium sized site. It's a design flaw. A simple modification of the anonymous filtering system won't work. If there was no caching this would work. The issue is the filter caching.

Think about it this way. If module loads (via hook_nodeapi) anything that's custom per user, time, or anything that changes regularly the caching system will blow up. The information stored in an object needs to stay static or relatively static. This is a major limiting factor.

If the caching system were out of the loop on some of this context information the issue wouldn't be there. Maybe the solution is run the filter system like it does now for caching. Then, after it loads a cached piece of information allow a hook or function to modify the filtered text. A kind of 2 stage filtering system where the context part of it is not cached. It would work similar to how hook_nodeapi with the view op is used now to alter text on nodes but it wouldn't be limited just nodes.

Does this make sense? Thoughts?

Comment #24

sun

German

Karlsruhe

commented 27 April 2009 at 19:12

Issue tags:

+FilterSystemRevamp

Comment #25

sun

German

Karlsruhe

commented 9 February 2010 at 15:14

Version:

7.x-dev

» 8.x-dev

Comment #26

dave reid

he/him

English

Nebraska USA

commented 7 March 2010 at 02:17

Issue tags:

+token

Tagging as this blocks creating an easy, simple token filter.
#141430: Merge Token and Rep[lacement]Tags module

Comment #27

sun

German

Karlsruhe

commented 7 March 2010 at 02:49

Issue tags:

-token

Reverting, #141430-10: Merge Token and Rep[lacement]Tags module

Comment #28

dave reid

he/him

English

Nebraska USA

commented 7 March 2010 at 03:02

Issue tags:

+token

WTF. It's a valid tag since it's blocking a potential token issue.

Comment #29

sun

German

Karlsruhe

commented 7 March 2010 at 03:25

To anyone concerned: You need to completely read and understand #8000: Filter contexts (and fix some small issues in filtering)

I've spent the last years tinkering about this challenge and as stated in that issue, it's one of the most critical problems for filter module authors, but we yet have to find a solid, and most importantly, reliable and secure solution.

Before attempting to write patches, we need very very creative and innovate ideas. If you have one, you can reach me anytime, everywhere.

Comment #30

xano

he/they

English

Southampton

commented 23 May 2010 at 23:07

Subscribing.

Haven't checked the patches yet, but one thing that got me stuck when writing a filter once, was that the filter doesn't know who's content it's filtering. Therefore my whole plan to convert /me is having lunch to *Xano is having lunch* didn't work out.

Comment #31

cwgordon7 commented 20 August 2010 at 04:32

Status:

Needs work

» Needs review

Status	File	Size
new	filter_context_02.patch	29.28 KB

Here's a thought: introduce a 'context callback' in hook_filter_info() that takes $type and $object as parameters, where $type is something like 'node', 'comment', or 'block', and $object is the $node, $comment, or $block object. The hook would then return an array of contextual data that it needs in order to filter. The caching is then handled based on changes in the contextual data: if the contextual data changes, the cache needs to be skipped. This effectively creates a new layer in which filters tell us when re-application of filters becomes necessary. Comes with a test to confirm that it's working as advertised. To confirm that this meets our needs, let's examine some use cases:

1. Converting /me is having lunch to *[Username] is having lunch*.

function example_filter_context_callback($text, $filter, $format, $type, $object) {
  switch ($type) {
    case 'node':
    case 'comment':
      $account = user_load($object->uid);
      // Filter will have to recache when the account name changes, either when the author of the comment/node changes or when the user changes their name.
      return array(
        'user_name' => $account->name,
      );
  }
}

2. Converting [[Image:1]] to the first image attached to the node.

function example_filter_context_callback($text, $filter, $format, $type, $object) {
  switch ($type) {
    case 'node':
      if (!empty($node->images)) {
        // Filter will have to recache when the images on a node change at all; i.e., they're added, removed, changed, or reordered.
        return array(
          'inline_images' => $node->images,
        );
      }
      break;
  }
}

3. Creating wikilinks based on organic group:

function example_filter_context_callback($text, $filter, $format, $type, $object) {
  switch ($type) {
    case 'comment':
      $object = node_load($object->nid);
    case 'node':
      if (!empty($object->og) && $group = node_load($object->og)) {
        // Filter will have to recache when the node's organic group changes (or is removed, or added, etc.), and also when the organic group's title changes.
        return array(
          'og_nid' => $group->nid,
          'og_title' => $group->title,
        );
      }
  }
}

4. Pirate filter needs to know if it's September 19th or not.

function example_filter_context_callback($text, $filter, $format, $type, $object) {
  // Caching is invalidated when the date changes to September 19th (pirate turns on), and also when the date changes to September 20th (pirate turns off).
  return array(
    'pirate_day' => (date('md') == '0919'),
  );
}

Note: these are all just code samples, they may not be correct in the particulars but the ideas behind them should be clear.

Even though this won't go in until 8.x, please let me know what you think!

Thanks!
Charlie

Comment #32

cwgordon7 commented 22 December 2010 at 21:15

Status	File	Size
new	filter_context_03.patch	27.6 KB

Rerolled the patch to keep up with HEAD.

Comment #33

sreynen commented 4 January 2011 at 15:47

Subscribing.

Comment #34

cwgordon7 commented 24 May 2011 at 20:17

Status	File	Size
new	filter-context-226963-34.patch	26.83 KB

Patch no longer applied, rerolled.

I'd appreciate some reviews on this, please. Thanks!

Comment #35

24 May 2011 at 21:02

Status:

Needs review

» Needs work

The last submitted patch, filter-context-226963-34.patch, failed testing.

Comment #36

sun

German

Karlsruhe

commented 24 May 2011 at 21:14

Thanks for keeping this rollin', @cwgordon7.

However, unless someone beats me to it, I need to find some time to do an in-depth review of #8000: Filter contexts (and fix some small issues in filtering) and related/duplicate issues, especially with regard to @unconed's strong objections (the original author of Drupal's text format and filter system), and pull out + summarize all of that into a separate, public post somewhere, possibly g.d.o. This will have to catch all and possibly hidden details and has to be 100% unbiased.

Without that, this issue won't go anywhere.

So, to clarify: We don't need further patches or patch reviews. This issue attempts to invalidate and ignore the objections of the person who originally wrote and designed the API you are trying to change. That API hasn't changed since it was invented, so it's not clear why we believe that all of those objections would suddenly not exist anymore. We need an absolutely rock solid and informed base, which allows us to prove that this change can be done and none of @unconed's counter-arguments still stand.

Comment #37

dave reid

he/him

English

Nebraska USA

commented 24 May 2011 at 21:27

Based on my quick read, it seemed that unconed's primary objection was that he wanted to solve the larger, overall context problem and not implementing something in the filter system which could then be adapted to a larger context system (*ahem*...Butler). In addition, it's SEVEN years after Steven's concerns about this. Drupal has changed a lot (yes, including the filter API), mostly for the better. We have tokens in core and this issue would help pave the way for a token filter. I don't think we should have to 100% disprove everything Steven said in order for this to fly.

Comment #38

cwgordon7 commented 25 May 2011 at 03:12

Status	File	Size
new	filter-context-226963-38.patch	26.84 KB

Steven's objections:

- Filters are used for more than just nodes (e.g. comments, but also profile fields and such). Any filter functionality which requires node access will have a high hack-factor IMO and would probably benefit from more extensive filter-system modifications. I know this is a rather annoying, but persistent fact of the filter system.

This system takes this into account and does not special case nodes, in fact blocks, which are not entities, can be used as contextual information as well. The 'type' parameter is passed in.

- Just giving access to the node object doesn't achieve much IMO: the filtering module has no ownership over the node, so any advanced filtering functionality would have to be implemented in odd and non- or badly configurable ways. At most, some extra node info can be read out, but IMO your mentioned use for this patch is rather unorthodox.

I have trouble understanding what this means. Perhaps this doesn't apply anymore, but made sense at the time the comment was written? If someone can clarify, please do so. In any case, we allow modules to provide context callbacks to extract the pieces of content they're going to need.

The biggest problems I see with the filter system is that it is 'stupid'. It treats anonymous pieces of text. Your patch clearly addresses this by giving contextual info by passing the node along. However, many other filter ideas require other kinds of contextual info, such as admins being able to bypass HTML tag limitations (user/role info), allowing user-chosen format options such as HTML vs Plain-text (text origin metadata) and such.

IMHO we shouldn't be considering a user's roles while filtering. Users should either have access to a filter, or not. Keeping the system simple is the way to minimize the potential for security holes. I'm not sure what was meant by "allowing user-chosen format options"; if someone can clarify, again, please do so.

Still, IMO we need to work out a proper context system for situations like this: people often want access to the current node from a block or from the theme.

Yes, it would be awesome if we had some sort of global contextual system (i.e. butler) but here we are 7 years later and we don't. This is a tangible, achievable feature and I don't think it should be forced to wait on a global contextual system; if and when such a system were introduced, it does not look like there would become any harder to change the filtering system over to such a system with or without this patch.

Plus, what about filtering comments? Maybe we need to pass the $comment object along, so filters can operate on that. And shouldn't we then also include the $node that the comment is replying to (remember, node_load is a db query)? The list goes on

This problem is solved by allowing modules to specify the contextual data they want in the context callback. If they need a comment's node, they can pass it along.

I've thought about defining what the filter system should and should not handle. I don't think it should handle only transformations unknown to the user, but I do think that it should only occupy itself with transforming the actual content that was typed: adding links, glossary tips, paragraph tags, smileys, etc. These all replace what the user typed with a more extensive /equivalent/.

I think it's a perfectly legitimate use case of a filtering system to know some contextual data. Filters do this all the time - the pirate filter, for example uses the contextual data of whether or not it's September 19th to change the way it filters. A filter that replaces "~~~" with the author's username and link to the author's page would be relying on contextual information, but would be perfectly legitimate.

I don't think this is all of Steven's objections, but that's at least most of what was on the other issue sun linked to.

This patch should fix at least some of the failures from earlier.

Comment #39

cwgordon7 commented 25 May 2011 at 03:20

Status:

Needs work

» Needs review

Comment #40

25 May 2011 at 04:02

Status:

Needs review

» Needs work

The last submitted patch, filter-context-226963-38.patch, failed testing.

Comment #41

mfer commented 25 May 2011 at 13:36

I've got 2 concerns / questions about this system:

The context variable passed into each of the filters has the total context for all the filters. It's not a setting/info for the filter but a context for all filters being applied to the text. Should it function this way or allow for each filter to build its own context?
This system opens a potential to blow up the filter caching system. For example, fields cache the text of body on a node. What about the case where a filter adds the node object to the context on a site where nodes get a fair number of comments. For every variation in the node object (not the fields value) there will be a new row in the filter cache. This would be nice for the case were you use a part of the node object on the filter. But, it makes keeping the cache clean difficult and creates a high potential to make this table full of not only a lot of rows but many that aren't being used.
Right now when text is updated the cache key for the filter is the same so it replaces the old cached row with a new one. With this change, and figuring a node entity is attached, the updated value would change on the node. This would change the cache key. So, the old cached row would remain and a new one would be added.

For sites with much activity there is quite a potential for this setup to cause the filter table to grow in size quite a bit. Imagine a site like Drupal.org and the number of comments on a node. We would have to be very careful about filter selection and the contexts included to make sure the cache_filter table/bin doesn't grow to many times the size it is now.

I'm not opposed to the change. We just need to make warning signs so people know they are jumping off a cliff and for the lemmings that follow that first person to also know there is a cliff ahead.

There are only two hard things in Computer Science: cache invalidation and naming things.
-- Phil Karlton

Comment #42

cwgordon7 commented 25 May 2011 at 17:58

Status:

Needs work

» Needs review

Status	File	Size
new	filter-context-226963-42.patch	30.77 KB

To address mfer's questions / concerns:

1. That's an interesting question. I'm honestly not sure which is better, but I did it this way because I felt there was more harm in providing too little information to the hook than providing too much. In addition, the langcode is added to the context, which all filters may need to check to function properly, and is logically grouped with contextual data. I'm open to changing this, though; it's not very hard to switch the way this works.

2. Good point. I've added a cache_clear_all() call to check_markup so that no more than one entry will ever exist in the cache_filter bin for any given $text. That is, if the contextual data changes, the cached filter entry will be cleared for that text, and replaced by a new one for the new contextual data.

This patch will hopefully pass tests.

Comment #43

cwgordon7 commented 25 May 2011 at 18:20

Status	File	Size
new	filter-context-226963-43.patch	32.43 KB

Forgot to add files.

Comment #44

moshe weitzman commented 25 May 2011 at 18:25

FWIW, the filter cache was nearly removed at #569238: Make check_markup() not cache by default and probably will be removed in D8. The idea is that Field API does sufficient caching of its own.

Comment #45

25 May 2011 at 19:12

Status:

Needs review

» Needs work

The last submitted patch, filter-context-226963-43.patch, failed testing.

Comment #46

cwgordon7 commented 25 May 2011 at 20:35

Status:

Needs work

» Needs review

Status	File	Size
new	filter-context-226963-46.patch	32.45 KB

One more try.

Comment #47

aaron commented 2 September 2011 at 19:58

subscribing. this would make #1268116: WYSIWYG does not record file usage a piece of cake.

Comment #48

c-logemann

he/him

German

Frankfurt/M, Germany

commented 14 November 2012 at 10:47

Status:

Needs review

» Closed (duplicate)

Just found this issue and see that my issue (#1830958: Provide more meta information to the filter system) is duplicate. But there is an very old issue of 2004 which was marked as duplicate of this.
I think it's important to focus on the old issue to make clear how old this feature request really is.
So I mark this issue as duplicate and will update the information in the old issue:
#8000: Filter contexts (and fix some small issues in filtering)

Comment #49

sun

German

Karlsruhe

commented 14 November 2012 at 21:36

@C_Logemann: I appreciate and actually can get behind your thinking, but re-opening a years-old, outdated, and stale issue, and subsequently marking the much more recent and active issue as duplicate of the former (which has been marked as duplicate of this issue in the first place), certainly is a terrible start and way to introduce yourself here.

I don't care for which issue it is, but I made sure to carefully think through this over the past ~3 years.

The latest patch in here seems to be on the right track. There's only a small but highly important detail from my perspective:

What we need to do is to replace the bool $cache argument of check_markup() with a bool|array value — negating its current meaning/behavior; i.e., if an empty context array is passed, the filtered text may be cached, but if a non-empty context array is passed, the filter cache is force-disabled.

Furthermore, the passed $context array should share the semantics and structure of the Tokens API — but without actually using it and indicating that any usage of it would be assumed or encouraged. Tokens are conceptually and architecturally NOT designed to participate in the filter system. However, the structure, definition, and lazy-instantiation of tokens in the token system IS compatible and beneficial, so this aspect of the concept should be borrowed and taken over; even if there's no API to back it yet (but tha can follow later).

Any takers? Although I'd love to, I doubt I'll be able to work on this until feature freeze, but I'd love to see this happening for D8. I'll definitely spare time for reviews and guidance.

Comment #50

c-logemann

he/him

German

Frankfurt/M, Germany

commented 14 November 2012 at 22:56

@sun: I think the discussion between 2008 and 2011 in this issue was following the same ideas like the old issue in 2004. When I switched the duplicate status in the issues I was thinking about the history and a focus on an very long needed feature. I think it doesn't matter where the discussion and the patches are "living". I can move code from here to there or my new rewritten issue summary from #8000: Filter contexts (and fix some small issues in filtering) to here. When you think it's better to switch back the duplicate logic I will fix this.
In any case I had the plan to build on the suggestions and the work which was already done in this issue. I will work on this the next days and your guidance is very welcome.

Comment #51

mstrelan commented 15 November 2012 at 01:47

Title:

Context-aware filters

» Context-aware text filters

I keep seeing notifications about this and thinking it's to do with Views or similar, so renaming to "text filters" to clarify that.

Comment #52

c-logemann

he/him

German

Frankfurt/M, Germany

commented 27 November 2012 at 16:49

Title:	Context-aware text filters	» Context-aware text filters (provide more meta information to the filter system)
Assigned:	cwgordon7	» c-logemann
Priority:	Normal	» Major
Status:	Closed (duplicate)	» Active

Ok. The most important thing is to move on solving this issues.
I am already working on "porting" the #46 patch to actual drupal 8 system.
I've closed the old issue and will copy my issue summary to this place.

Because of its real age with 8 years I mark this feature request as "major" and think about "critical" (if this is suitable for feature requests).

Comment #53

xano

he/they

English

Southampton

commented 27 November 2012 at 19:44

Furthermore, the passed $context array should share the semantics and structure of the Tokens API — but without actually using it and indicating that any usage of it would be assumed or encouraged. Tokens are conceptually and architecturally NOT designed to participate in the filter system. However, the structure, definition, and lazy-instantiation of tokens in the token system IS compatible and beneficial, so this aspect of the concept should be borrowed and taken over; even if there's no API to back it yet (but tha can follow later).

Shouldn't we start abstracting the token API rather than copy the relevant parts and worry about abstraction later? Unless I'm missing something, we can reuse most of the token API (the context-awareness, for instance). Tokens really are a kind of context-aware input filters, aren't they? It's the chaining we can't reuse and that is token-specific.

Comment #54

c-logemann

he/him

German

Frankfurt/M, Germany

commented 28 November 2012 at 23:39

Status:	Active	» Needs review
Issue tags:		+Needs documentation, +API addition

Status	File	Size
new	filter-context-226963-54.patch	34.94 KB

Just done the "porting" of #46 patch.

@sun:
Why should we force disable the cache when a context is available?
But if this is not an performance issue we can do this.
Should this be forced with logic in the check_markup function or just when it's called with 'cache' => FALSE. Currently I've copied all old 'cache' => TRUE of #46.
For me the most important thing is to get entity information to the filter system.
I am not very familiar with token API. Can you provide a link or give an example how to organize the $context array?

Comment #55

28 November 2012 at 23:45

Status:	Needs review	» Needs work
Issue tags:	-Needs documentation, -token, -FilterSystemRevamp, -API addition

The last submitted patch, filter-context-226963-54.patch, failed testing.

Comment #56

c-logemann

he/him

German

Frankfurt/M, Germany

commented 28 November 2012 at 23:59

Status:

Needs work

» Needs review

#54: filter-context-226963-54.patch queued for re-testing.

Comment #57

29 November 2012 at 07:32

Status:	Needs review	» Needs work
Issue tags:		+Needs documentation, +token, +FilterSystemRevamp, +API addition

The last submitted patch, filter-context-226963-54.patch, failed testing.

Comment #58

c-logemann

he/him

German

Frankfurt/M, Germany

commented 29 November 2012 at 10:54

Ok, the second test error messages are much more helpful than the first "Detect a Drupal installation failure". I will try to fix this.

In the next step I try to introduce "fieldname" and "fielddelta" as context to fill if available.

In this way and for other possible addons to the context it would be better to provide a context array to the context callback function.

With trying a small custom filter in a custom module I realized that filters can get namespace conflicts in defining context addons.
There should be module name prefixes for that. Maybe a filter author can work with contexts of of others and define the related modules as dependency.

On my first test without a context callback I missed the entity and saw that I need to add this by myself.
Isn't it better to provide the object and type as standard context for filters to avoid multiple definitions of this context?

Comment #59

c-logemann

he/him

German

Frankfurt/M, Germany

commented 29 November 2012 at 22:59

Status:

Needs work

» Needs review

Status	File	Size
new	filter-context-226963-59.patch	35 KB

For this error I have only a workaround:

An AJAX HTTP error occurred. HTTP Result Code: 200 Debugging information follows. Path: /batch?id=2&op=do_nojs&op=do StatusText: OK ResponseText: Fatal error: Call to undefined function cache_clear_all() in /d8/core/modules/filter/filter.module on line 920

Here is line 918-922

   if ($cache) {
    // Avoid flooding the cache_filter table with outdated entries.
    cache_clear_all($format->format . ':' . hash('sha256', $text) . ':', 'cache_filter', TRUE);
    cache('filter')->set($cache_id, $text, CacheBackendInterface::CACHE_PERMANENT, array('filter_format' => $format->format));
  }

I just commented out line 920. Does somebody else knows better to fix this cache clear code of #46 patch?

Comment #60

c-logemann

he/him

German

Frankfurt/M, Germany

commented 30 November 2012 at 15:51

Status	File	Size
new	filter-context-226963-60.patch	37.64 KB

I've introduced fieldname and fielddelta. I've currently problems to get this into the Formatters:
/modules/field/modules/text/lib/Drupal/text/Plugin/field/formatter/TextTrimmedFormatter.php
/modules/field/modules/text/lib/Drupal/text/Plugin/field/formatter/TextDefaultFormatter.php

To handle fieldname and fielddate I changed the API in _text_sanitize() and the context callback.

Comment #61

kerasai commented 24 May 2013 at 08:31

Issue tags:

-Needs documentation, -token, -FilterSystemRevamp, -API addition

#60: filter-context-226963-60.patch queued for re-testing.

Comment #62

24 May 2013 at 08:34

Status:	Needs review	» Needs work
Issue tags:		+Needs documentation, +token, +FilterSystemRevamp, +API addition

The last submitted patch, filter-context-226963-60.patch, failed testing.

Comment #63

c-logemann

he/him

German

Frankfurt/M, Germany

commented 24 May 2013 at 09:21

There was a lot of code change in D8 since building the last patch so I am not wondering about the failed testing. Some additional work has to be done to get this patch woking again.

Without a clear decision by the core maintainers about this feature request I don't spend more time in this. Maybe I try to get my "knowledge management" features running on the level of "entity load". But this would be not the best solution in my opinion.

Comment #64

ronliskey commented 26 June 2013 at 17:28

Status:	Needs work	» Needs review
Issue tags:	-Needs documentation, -token, -FilterSystemRevamp, -API addition

#60: filter-context-226963-60.patch queued for re-testing.

Comment #65

26 June 2013 at 17:29

Status:	Needs review	» Needs work
Issue tags:		+Needs documentation, +token, +FilterSystemRevamp, +API addition

The last submitted patch, filter-context-226963-60.patch, failed testing.

Comment #66

kbentham commented 15 August 2013 at 16:23

This needs to be rerolled. There are 21 conflicts so someone with advanced experience with how the views module works needs to reroll this.

Comment #67

chx commented 15 August 2013 at 17:56

Version:	8.x-dev	» 9.x-dev
Priority:	Major	» Normal

Sorry but feature requests go into Drupal 9 now.

Comment #68

c-logemann

he/him

German

Frankfurt/M, Germany

commented 15 August 2013 at 21:57

Assigned:

c-logemann

» Unassigned

I hope there will come any clear decision by the core maintainers what the way should be.

Comment #69

moshe weitzman commented 15 August 2013 at 22:06

I think filter caching goes away when we have render caching of entities (still planned for d8). filter cache almost got removed from d7 so i hope we remove it from d8. then, this issue becomes mostly moot.

Comment #69.0

moshe weitzman commented 15 August 2013 at 22:06

Issue summary:

View changes

Complete rewritten issue summary

Comment #70

wim leers

Ghent 🇧🇪🇪🇺

commented 11 June 2014 at 20:40

As of #2217877: Text filters should be able to add #attached, #post_render_cache, and cache tags, filter caching is gone. See https://drupal.org/node/2278483 for a high-level explanation, see the issue for details.

However, I'm not sure I understand #69 — I don't see how removing filter caching helps to pass more metadata to the filter system. It makes it easier to pass more metadata to the filter system, because the filtered text is now cached as part of the cached version of the rendered entity instead of the filtered text being cached "globally per language", but that's it.

Comment #71

catch

he/him

English

commented 5 November 2014 at 15:51

Version:	9.x-dev	» 8.1.x-dev
Category:	Feature request	» Task

Just an API addition so could go into a minor version.

Comment #72

5 November 2014 at 15:51

Version:

8.1.x-dev

» 8.2.x-dev

Drupal 8.1.0-beta1 was released on March 2, 2016, which means new developments and disruptive changes should now be targeted against the 8.2.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Comment #73

5 November 2014 at 15:51

Version:

8.2.x-dev

» 8.3.x-dev

Drupal 8.2.0-beta1 was released on August 3, 2016, which means new developments and disruptive changes should now be targeted against the 8.3.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Comment #74

5 November 2014 at 15:51

Version:

8.3.x-dev

» 8.4.x-dev

Drupal 8.3.0-alpha1 will be released the week of January 30, 2017, which means new developments and disruptive changes should now be targeted against the 8.4.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Comment #75

5 November 2014 at 15:51

Version:

8.4.x-dev

» 8.5.x-dev

Drupal 8.4.0-alpha1 will be released the week of July 31, 2017, which means new developments and disruptive changes should now be targeted against the 8.5.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Comment #76

5 November 2014 at 15:51

Version:

8.5.x-dev

» 8.6.x-dev

Drupal 8.5.0-alpha1 will be released the week of January 17, 2018, which means new developments and disruptive changes should now be targeted against the 8.6.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Comment #77

5 November 2014 at 15:51

Version:

8.6.x-dev

» 8.7.x-dev

Drupal 8.6.0-alpha1 will be released the week of July 16, 2018, which means new developments and disruptive changes should now be targeted against the 8.7.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Comment #78

5 November 2014 at 15:51

Version:

8.7.x-dev

» 8.8.x-dev

Drupal 8.7.0-alpha1 will be released the week of March 11, 2019, which means new developments and disruptive changes should now be targeted against the 8.8.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Comment #79

rgpublic

German

Düsseldorf 🇩🇪 🇪🇺

commented 24 April 2019 at 13:59

Long time no see :-) This issue is getting quite old, but I still think this is very badly needed. The main problem, of course, is caching. I think we shouldn't just assume anything (and in consequence either have lots of duplicated keys in the cache or on the other end of the spectrum actually different stuff overwrite the same cache keys). Instead, IMHO we should *ask* the filter plugin about necessary cache keys or provide a way to respond back on which cache keys it depends. Or - the other way round - we could provide the context information via a separate object and as soon as a filter plugin calls a method/queries a property of that object, this would be considered a dependent cache key.

Comment #80

24 April 2019 at 13:59

Version:

8.8.x-dev

» 8.9.x-dev

Drupal 8.8.0-alpha1 will be released the week of October 14th, 2019, which means new developments and disruptive changes should now be targeted against the 8.9.x-dev branch. (Any changes to 8.9.x will also be committed to 9.0.x in preparation for Drupal 9’s release, but some changes like significant feature additions will be deferred to 9.1.x.). For more information see the Drupal 8 and 9 minor version schedule and the Allowed changes during the Drupal 8 and 9 release cycles.

Comment #81

24 April 2019 at 13:59

Version:

8.9.x-dev

» 9.1.x-dev

Drupal 8.9.0-beta1 was released on March 20, 2020. 8.9.x is the final, long-term support (LTS) minor release of Drupal 8, which means new developments and disruptive changes should now be targeted against the 9.1.x-dev branch. For more information see the Drupal 8 and 9 minor version schedule and the Allowed changes during the Drupal 8 and 9 release cycles.

Comment #82

andypost

he/him

Russian

commented 4 August 2020 at 22:41

Issue tags:

+blocker, +Needs issue summary update

11 files were hidden/shown/deleted

Status	File	Size
hidden	filters_with_context.patch	8.34 KB
hidden	filter_context_01.patch	14.56 KB
hidden	filter_context_02.patch	29.28 KB
hidden	filter_context_03.patch	27.6 KB
hidden	filter-context-226963-34.patch	26.83 KB
hidden	filter-context-226963-38.patch	26.84 KB
hidden	filter-context-226963-42.patch	30.77 KB
hidden	filter-context-226963-43.patch	32.43 KB
hidden	filter-context-226963-46.patch	32.45 KB
hidden	filter-context-226963-54.patch	34.94 KB
hidden	filter-context-226963-59.patch	35 KB

Ref #2552497-18: [PP-1] Convert search_excerpt() to a filter

Last patch needs re-work from ground up

Comment #83

4 August 2020 at 22:41

Version:

9.1.x-dev

» 9.2.x-dev

Drupal 9.1.0-alpha1 will be released the week of October 19, 2020, which means new developments and disruptive changes should now be targeted for the 9.2.x-dev branch. For more information see the Drupal 9 minor version schedule and the Allowed changes during the Drupal 9 release cycle.

Comment #84

4 August 2020 at 22:41

Version:

9.2.x-dev

» 9.3.x-dev

Drupal 9.2.0-alpha1 will be released the week of May 3, 2021, which means new developments and disruptive changes should now be targeted for the 9.3.x-dev branch. For more information see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Comment #85

4 August 2020 at 22:41

Version:

9.3.x-dev

» 9.4.x-dev

Drupal 9.3.0-rc1 was released on November 26, 2021, which means new developments and disruptive changes should now be targeted for the 9.4.x-dev branch. For more information see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Comment #86

mxh commented 22 December 2021 at 12:35

Sorry for being OT, but maybe this can be helpful: I'm working on a Context Stack, which provides current, parent and root context from entity view and current account scope. Its API takes care of collecting cacheability metadata when added as cacheable dependency within any plugin. Feel free to try this one out.

Comment #87

22 December 2021 at 12:35

Version:

9.4.x-dev

» 9.5.x-dev

Drupal 9.4.0-alpha1 was released on May 6, 2022, which means new developments and disruptive changes should now be targeted for the 9.5.x-dev branch. For more information see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Comment #88

22 December 2021 at 12:35

Version:

9.5.x-dev

» 10.1.x-dev

Drupal 9.5.0-beta2 and Drupal 10.0.0-beta2 were released on September 29, 2022, which means new developments and disruptive changes should now be targeted for the 10.1.x-dev branch. For more information see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Comment #89

22 December 2021 at 12:35

Version:

10.1.x-dev

» 11.x-dev

Drupal core is moving towards using a “main” branch. As an interim step, a new 11.x branch has been opened, as Drupal.org infrastructure cannot currently fully support a branch named main. New developments and disruptive changes should now be targeted for the 11.x branch, which currently accepts only minor-version allowed changes. For more information, see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Comment #90

22 December 2021 at 12:35

Version:

11.x-dev

» main

Drupal core is now using the main branch as the primary development branch. New developments and disruptive changes should now be targeted to the main branch.